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METHOD AND POLYNUCLEOTIDES FOR DETERMINING 
TRANSNATIONAL EFFICIENCY OF A CODON 

FIELD OF THE INVENTION 

THIS INVENTION relates generally to gene expression and 
5 in particular, to a method and polynucleotides for 
determining codon utilization in particular cells or tissues 
of an organism. More particularly, the method and 

polynucleotides of the invention are concerned with 
ascertaining codon preferences in cells or tissues for the 
10 purpose of modifying the translational efficiency of protein- 
encoding polynucleotides in those cells or tissues. 

BACKGROUND OF THE INVENTION 

It is well known that a " triplet" codon of four 
possible nucleotide bases can exist in 64 variant forms. 

15 These forms provide the message for only 20 different amino 
acids (as well as translation initiation and termination) and 
this means that some amino acids can be encoded by more than 
one codon. Some amino acids have as many as six " redundant", 
alternative codons while some others have a single, required 

20 codon. 

For reasons not completely understood, codon 
utilization is highly biased in that alternative codons are 
not at all uniformly present in the endogenous DNA of 
differing cell types. In this regard, there appears to exist 
25 a variable natural hierarchy of "preference" for certain 
codons between different cell types or between different 
organisms . 

Codon usage patterns have been shown to correlate with 
relative abundance of isoaccepting transfer RNA (iso-tRNA) 
30 species, and with genes encoding proteins of high versus low 
abundance. Moreover, the present inventors recently 
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discovered that the intracellular abundance of different iso- 
tRNAs varies in different cells or tissues of a single multi- 
cellular organism (see copending International Application 
No. PCT/AU98/00530) . 

5 The implications of codon preference phenomena on gene 

expression are manifest in that these phenomena can affect 
the translational efficiency of messenger RNA (mRNA) . It is 
widely known in this regard that translation of "rare 
codons" , for which the corresponding iso-tRNA is in 
10 relatively low abundance, may cause a ribosome to pause 
during translation which can lead to a failure to complete a 
nascent polypeptide chain and an uncoupling of transcription 
and translation. 

A primary goal in recombinant research is to provide 
15 transgenic organisms with expression of a foreign gene in an 
amount sufficient to confer the. desired phenotype to the 
organism. However, expression of the foreign gene may be- 
severely impeded if a particular host cell of the organism or 
the organism itself has a low abundance of iso-tRNAs 
20 corresponding to one or more codons of the foreign gene. 

Accordingly, a major aim of investigators in this field is to 
first ascertain the codon preference for particular cells or 
tissues in which a foreign gene' is to be expressed, and to 
subsequently alter the codon composition of the foreign gene 
25 for optimized expression in those cells or tissues. 

Codon preference may be determined simply by analyzing 
the frequency at which codons are used by genes expressed in 
a particular cell or tissue or in a plurality of cells or 
tissues of a given organism. Codon frequency tables as well 
30 as suitable methods for determining frequency of codon usage 
in an organism are described, for example, in an article by 
Sharp et al (1988, Nucleic Acids Res. 16 8207-8211). The 
relative level of gene expression (e.g., detectable protein 
expression Vs no detectable protein expression) can provide 
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an indirect measure of the relative abundance of specific 
iso-tRNAs expressed in different cells or tissues. 

Alternatively, codon preference may be determined by 
measuring the relative intracellular abundance of different 
5 iso-tRNA species. For example, reference may be made to 
copending International Application No. PCT/AU98 /00530 that 
describes a method that utilizes labeled oligonucleotides 
specific for different iso-tRNAs to probe an RNA extract 
prepared from a particular cell or tissue source. 

10 The above methods provide useful indirect evidence for 

determining codon preference. However, such indirect 

evidence may not provide an accurate indication of the 
translational efficiency of a given codon. Accordingly, 
there is a need to provide a method that more directly 

15 ascertains the translational efficiency of a codon in a cell 
or tissue. 

fiTTMMAKY OF TU R T TJVF.KFT T ON 

In one aspect of the invention, there is provided a 
method for determining the translational efficiency of an 
20 individual codon in a cell, said method comprising: 

- introducing into said cell a synthetic construct 
comprising a reporter polynucleotide fused in frame with a 
tandem repeat of said individual codon, wherein said 
reporter polynucleotide encodes a reporter protein, and 

25 wherein said synthetic construct is operably linked to a 

regulatory polynucleotide; and 

- measuring expression of said reporter protein in 
said cell to determine the translational efficiency of said 
codon . 

30 Preferably, the method further comprises comparing: 
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- expression of said reporter protein in a cell to 
which a synthetic construct comprising a tandem repeat of- 
said individual codon was provided; and 

- expression of said reporter protein in a cell to 
5 which a synthetic construct comprising a tandem repeat of 

another individual codon was provided; 
to thereby determine the relative translational efficiency of 
said individual codons in said cell. 

Suitably, the method further comprises comparing: 

10 - expression of said reporter protein in a cell to 

which a synthetic construct comprising a tandem repeat of 
said individual codon was provided; and 

- expression of said reporter protein in another cell 
to which a synthetic construct comprising a tandem repeat 

15 of said individual codon was provided; 

to thereby determine the translational efficiency of said 
individual codon in said cell relative to said other cell. 

Preferably, the method further comprises: 

- introducing the synthetic construct into a 
20 progenitor cell of said cell; and 

- producing said cell from said progenitor cell; 
wherein said cell contains said synthetic construct. 
Suitably, the method further comprises: 

- introducing the synthetic construct into a 
25 progenitor of said cell; and 

- growing an organism or part thereof from said 
progenitor cell; 

wherein said organism comprises said cell containing 
said synthetic construct. 
30 Suitably, the method further comprises: 
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- introducing the synthetic construct into an organism 
or part thereof such that said synthetic construct is 
introduced into said cell. 

In another aspect, the invention resides in a synthetic 
5 construct comprising a reporter polynucleotide fused in frame 
with a tandem repeat of individual codons, wherein said 
reporter polynucleotide encodes a reporter protein, and 
wherein said synthetic construct is operably linked to a 
regulatory polynucleotide. • 

10 In yet another aspect of the invention, there is 

provided an improved method of constructing a synthetic 
polynucleotide from which a protein is selectively 
expressible in a target cell of an organism, relative to 
another cell of the organism, said method comprising: 

15 - selecting a first codon of a parent polynucleotide 

for replacement with a synonymous codon which has a higher 
translational efficiency in said target cell than in said 
other cell; and 

- replacing said first codon with said synonymous 
20 codon to form said synthetic polynucleotide, wherein said 

first codon and said synonymous codon are selected by: 

- comparing translational efficiencies of 
individual codons in said target cell relative to said 
other cell using the method broadly described above; 

25 and 

- selecting said first codon and said synonymous 
codon based on said comparison. 

Preferably, said synonymous codon corresponds to a 
reporter construct from which the reporter protein is 
30 expressed in said target cell at a. level that is at least 
110%, preferably at least 200%, more preferably at least 
500%, and most preferably at least 1000%, of that expressed 
from said reporter construct in said other cell. 
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In a further aspect, the invention provides an improved 
method of constructing a synthetic polynucleotide from which 
a protein is expressible in a target cell of an organism at a 
higher level than from a parent polynucleotide encoding said 
protein, said method comprising: 

- selecting a first codon of the parent polynucleotide 
for replacement with a synonymous codon which has a higher 
translational efficiency in said target cell than said 
first codon; 

- replacing said first codon with said synonymous 
codon to form said synthetic polynucleotide, wherein said 
first codon and said synonymous codon are selected by: 

- comparing translational efficiencies of 
different individual codons in said target cell using 
the method broadly described above; and 

- selecting said first codon and said synonymous 
codon based on said comparison. 

Suitably, said synonymous codon corresponds to a 
reporter construct from which the reporter protein is 
expressed in said target cell at ^a level that is at least 
110%, preferably at least 200%, more preferably at least 
500%, and most preferably at least 1000%, of that expressed 
from the different reporter construct corresponding to said 
first codon. 

DETAILED DESCRIPTION 

1 . Definitions 

The articles "a" and "an" are used herein to refer to 
one or to more than one (i.e., to at least one) of the 
grammatical object of the article. By way of example, "an 
element" means one element or more than one element. 
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Throughout this specification, unless the context 
requires otherwise, the words "comprise" , "comprises" and 
"comprising" will be understood to imply the inclusion of a 
stated step or element or group of steps or elements but not 
the exclusion of any other step or element or group of steps 
or elements. 

By "expressible" is meant expression of a protein to a 
level sufficient to effect a particular function associated 
with the protein. By contrast, the terms "not expressible" 
and "not substantially expressible" as used interchangeably 
herein refers to (a) no expression of a protein, (b) 
expression of a protein to a level that is not sufficient to 
effect a particular function associated with the protein, (c) 
expression of a protein, which cannot be detected by a 
monoclonal antibody specific for the protein, or (d) 
expression of a protein, which is less that 1% of the level 
expressed in a wild-type cell that normally expresses the 
protein . 

By "expressing said synthetic construct" is meant 
transcribing the synthetic construct such that mRNA is 
produced. 

By "expression vector" is meant any autonomous genetic 
element capable of directing the synthesis of a protein 
encoded by the vector. Such expression vectors are known by 
practitioners in the art. 

As used herein, the term "function" refers to a 
biological, enzymatic, or therapeutic function. 

By "highly expressed genes" is meant genes that express 
high levels of mRNA, and preferably high level of protein, 
relative to other genes. 

By "isoaccepting transfer RNA" or "iso-tRNA" is meant 
one or more transfer RNA molecules that differ in their 
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anticodon nucleotide sequence but are specific for the same 
amino acid. 

By "natural gene" is meant a gene that naturally 
encodes the protein. However, it is possible that the parent 
5 polynucleotide encodes a protein that is not naturally- 
occurring but has been engineered using recombinant 
techniques . 

The term "non-cycling cell" as used herein refers to a 
cell that has withdrawn from the cell cycle and has entered 

10 the GO state. In this state, it is known that transcription 
of endogenous genes and protein translation are at 
substantially reduced levels compared to phases of the cell 
cycle, namely Gl, S, G2 and M. By contrast, the term 
"cycling cell" as used herein refers to a cell, which is in 

15 one of the above phases of the cell cycle. 

By "obtained from" is meant that a sample such as, for 
example, a polynucleotide extract or polypeptide extract is 
isolated from, or derived from, a particular source of the 
host. For example, the extract can be obtained from a tissue 
20 or a biological fluid isolated directly from the host. 

The term "oligonucleotide" as used herein refers to a 
polymer composed of a multiplicity of nucleotide residues 
(deoxyribonucleotides or ribonucleotides, or related 
structural variants or synthetic analogues thereof) linked 

25 via phosphodiester bonds (or related structural variants or 
synthetic analogues thereof) . Thus, while the term 

^oligonucleotide" typically refers to a nucleotide polymer in 
which the nucleotide residues and linkages between them are 
naturally occurring, it will be understood that the term also 

30 includes within its scope various analogues including, but 
not restricted to, peptide nucleic acids (PNAs), 
phosphorami dates , phosphorothioates , methyl phosphonates , 2- 
0-methyl ribonucleic acids, and the like. The exact size of 
the molecule can vary depending on the particular 
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application. An oligonucleotide is typically rather short in 
length, generally from about 10 to 30 nucleotide residues, 
but the term can refer to molecules of any length, although 
the term "polynucleotide" or "nucleic acid" is typically used 
5 for large oligonucleotides. 

By "operably linked" is meant that transcriptional and 
translational regulatory polynucleotides are positioned 
relative to a polypeptide-encoding polynucleotide in such a 
manner that the polynucleotide is transcribed and the 
10 polypeptide is translated. 

By "pharmaceutically-acceptable carrier" is meant a 
solid or liquid filler, diluent or encapsulating substance 
that can be safely used in topical or systemic administration 
to a mammal. 

15 "Polypeptide", "peptide" and "protein" are used 

interchangeably herein to refer to a polymer of amino acid 
residues and to variants and synthetic analogues of the same. 
Thus, these terms apply to amino acid polymers in which one 
or more amino acid residues is a synthetic non-naturally 

20 occurring amino acid, such as a chemical analogue of a 
corresponding naturally occurring amino acid, as well as to 
naturally-occurring amino acid polymers. 

The term "polynucleotide" or "nucleic acid' as used 
herein designates mRNA, RNA, cRNA, cDNA or DNA. The term 
25 typically refers to oligonucleotides greater than 30 
nucleotide residues in length. 

By "primer" is meant an oligonucleotide which, when 
paired with a strand of DNA, is capable of initiating the 
synthesis of a primer extension product in the presence of a 
30 suitable polymerizing agent. The primer is preferably 
single-stranded for maximum efficiency in amplification but 
can alternatively be double-stranded. A primer must be 
sufficiently long to prime the synthesis of extension 
products in the presence of the polymerization agent. The 
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length of the primer depends on many factors, including 
application, temperature to be employed, template reaction 
conditions, other reagents, and source of primers. For 
example, depending on the complexity of the target sequence, 
5 the oligonucleotide primer typically contains 15 to 35 or 
more nucleotide residues, although it can contain fewer 
nucleotide residues. Primers can be large polynucleotides, 
such as from about 200 nucleotide residues to several 
kilobases or more. Primers can be selected to be 

10 ""substantially complementary" to the sequence on the template 
to which it is designed to hybridize and serve as a site for 
the initiation of synthesis. By ""substantially 

complementary" , it is meant that the primer is sufficiently 
complementary to hybridize with a target polynucleotide. 

15 Preferably, the primer contains no mismatches with the 
template to which it is designed to hybridize but this is not 
essential. For example, non-complementary nucleotide 

residues can be attached to the 5' end of the primer, with 
the remainder of the primer sequence being complementary to 

20 the template. Alternatively, non-complementary nucleotide 
residues or a stretch of non-complementary nucleotide 
residues can be interspersed into a primer, provided that the 
primer sequence has sufficient complementarity with the 
sequence of the template to hybridize therewith and thereby 

25 form a template for synthesis of the extension product of the 
primer . 

"Probe" refers to a molecule that binds to a specific 
sequence or sub-sequence or other moiety of another molecule. 
Unless otherwise indicated, the term ""probe" typically refers 
30 to a polynucleotide probe that binds to another 
polynucleotide, often called the w target polynucleotide", 
through complementary base pairing. Probes can bind target 
polynucleotides lacking complete sequence complementarity 
with the probe, depending on the stringency of the 
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hybridization conditions. Probes can be labeled directly or 
indirectly. 

The terms "precursor cell or tissue'' and "progenitor 
cell or tissue" as -used herein refer to a cell or tissue that 
5 can gives rise to a particular cell or tissue in which 
protein expression is to be targeted or in which 
translatio.nal efficiency of a codon is to be determined. 

By "recombinant polypeptide" is meant a polypeptide 
made using recombinant techniques, i.e., through the 
10 expression of a recombinant or synthetic polynucleotide. 

"Stringency" as used herein, refers to the temperature 
and ionic strength conditions, and presence or absence of 
certain organic solvents, during hybridization. The higher 
the stringency, the higher will be the degree of 
15 complementarity between immobilized polynucleotides and the 
labeled polynucleotide. 

"Stringent conditions" refers to temperature and ionic 
conditions under which only polynucleotides having a high 
frequency of complementary bases will hybridize. The 

20 stringency required is nucleotide sequence dependent and 
depends upon the various components present during 
hybridization. Generally, stringent conditions are selected 
to be about 10 to 20°C lower than the thermal melting point 
(TjJ for the specific sequence at a defined ionic strength 

25 and pH . The T m is the temperature (under defined ionic 
strength and pH) at which 50% of a target sequence hybridizes 
to a complementary probe. 

The term "synthetic polynucleotide" as used herein 
refers to a polynucleotide formed in vitro by the 
30 manipulation of a polynucleotide into a form not normally 
found in nature. For example, the synthetic polynucleotide 
can be in the form of an expression vector. Generally, such 
expression vectors include transcriptional and translational 
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regulatory polynucleotide operably linked to the 
polynucleotide . 

The term "synonymous codon" as used herein refers to a 
codon having a different nucleotide sequence than another 
5 codon but encoding the same amino acid as that other codon. 

By "translational efficiency'' is meant the efficiency 
of a cell's protein synthesis machinery to incorporate the 
amino acid encoded by a codon into a nascent polypeptide 
chain. This efficiency can be evidenced, for example, by the 
10 rate at which the cell is able to synthesize the polypeptide 
from an RNA template comprising the codon, or by the amount 
of the polypeptide synthesized from such a template. 

By "vector" is meant a polynucleotide molecule, 
preferably a DNA molecule derived, for example, from a 

15 plasmid, bacteriophage, or plant virus, into which a 
polynucleotide can be inserted or cloned. A vector 

preferably contains one or more unique restriction sites and 
can be capable of autonomous replication in a defined host 
cell including a target cell or tissue or a progenitor cell 

20 or tissue thereof, or be integrable with the genome of the 
defined host such that the cloned sequence is reproducible. 
Accordingly, the vector can be an autonomously replicating 
vector, i.e., a vector that exists as an extrachromosomal 
entity, the replication of which is independent of 

25 chromosomal replication, e.g., a linear or closed circular 
plasmid, an extrachromosomal element, a minichromosome , or an 
artificial chromosome. The vector can contain any means for 
assuring self -replication . Alternatively, the vector can be 
one which, when introduced into the host cell, is integrated 

30 into the genome and replicated together with the 
chromosome ( s) into which it has been integrated. A vector 
system can comprise a single vector or plasmid, two or more 
vectors or plasmids, which together contain the total DNA to 
be introduced into the genome of the host cell, or a 
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transposon. The choice of the vector will typically depend 
on the compatibility of the vector with the host cell into 
which the vector is to be introduced. The vector can also 
include a selection marker such as an antibiotic resistance 
5 gene that can be used for selection of suitable 
transformants. Examples of such resistance genes are known 
to those of skill in the art and include the nptll gene that 
confers resistance to the antibiotics kanamycin and G418 
(Geneticin®) and the hph gene which confers resistance to the 
10 antibiotic hygromycin B. 

2. Method of the invention 

The present invention is based, at least in part, on 
the discovery that different but synonymous stretches of 
identical codons fused respectively in frame with a reporter 

15 polynucleotide can give rise to different levels of reporter 
protein expressed within a given cell type. Not wishing to 
be bound by any particular theory, it is believed that a 
tandem series of identical codons causes a ribosome to pause 
during translation if the iso-tRNA corresponding to the 

20 identical codons is limiting. In this regard, it is known 
that ribosomal pausing leads to a failure to complete a 
nascent polypeptide chain and an uncoupling of transcription 
and translation. Accordingly, the levels of reporter protein 
expressed in the different cells or tissues are sensitive to 

25 the intracellular abundance of the iso-tRNA species 
corresponding to the identical codons and, therefore, provide 
a direct correlation of a cell's or tissue's preference for 
translating a given codon. This means, for example, that if 
the levels of the reporter protein obtained in a cell or 

30 tissue type to which a synthetic construct having a tandem 
series of identical first codons is provided are lower than 
the levels expressed in the same cell or tissue type to which 
a different synthetic construct having a tandem series of 
identical second codons is provided (i.e., wherein the first 
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codons are different from, but synonymous with, the second 
codons) , then it can be deduced that the cell or tissue has a 
higher preference for the second codon relative to the first 
codon with respect to translation. Put another way, the 
second codon has a higher translational efficiency compared 
to the first codon in the cell or tissue type. 

With regard to differential protein expression between 
different cell or tissue types, it will be appreciated that 
if the levels of the reporter protein obtained in a target 
cell or tissue type to which a synthetic construct having a 
tandem series of identical codons is provided are lower than 
the levels expressed in the another cell or tissue type to 
which the same synthetic construct is provided, then it can 
be deduced that the target cell or tissue has a higher 
preference for the codon relative to the other cell or tissue 
with respect to translation. Put another way, the codon has 
a higher translational efficiency in the target cell or 
tissue relative to the other cell or tissue type. 

As used herein, expression of a protein in a tissue 
refers alternatively to expression of the protein within a 
cell of the tissue or production of the protein within a cell 
and export of the protein from the cell to, for example, the 
extracellular matrix of a tissue. 

Suitably, the tandem repeat comprises at least three 
identical codons. Preferably, the tandem repeat comprises 
four identical codons, more preferably five or seven 
identical codons and most preferably six identical codons. 

The tandem repeat can be fused at a location adjacent 
to, or within, the reporter polynucleotide. The location is 
preferably selected such that the tandem repeat interferes 
with translation of at least a detectable portion of the 
reporter protein such that expression of the protein can be 
detected or assessed. Preferably, the tandem repeat is 
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located immediately upstream (translationally ) from the 
reporter polynucleotide. 

It is of course possible that a tandem repeat of 
identical amino acid residues (e.g., an oligo-proline repeat) 
can render the reporter protein unstable. Typically, protein 
instability is detected when expression of the reporter gene 
is not detectable with any choice of isoaccepting codon 
specific for the amino acid corresponding to the tandem 
repeat. The inventors have . found in this regard that protein 
instability can be alleviated by use of at least one spacer 
codon within the tandem repeat of identical codons, wherein 
the spacer codon encodes a neutral amino acid. 

The at least one spacer codon can be placed adjacent 
to, or interposed between, some or all of the identical 
codons corresponding to the tandem repeat. For example, a 
suitable interposition for a penta-repeat of identical codons 
can be selected from the group consisting of: (a) I-S-I-S-I- 
S-I-S-I-S; (b) S-I-S-I-S-I-S-I-S-I; (c) I-S-I-S-I-I-S-I ; (d) 
I-S-I-I-S-I-S-I; (e) I-S-I-S-I-I-I; (f) I-I-S-I-S-I-I ; (g) I- 
I-I-S-I-S-I; (h) I-S-I-I-S-I-I; (i) I-I-S-I-I-S-I; (j) I-S-I- 
I _ I _s_i ; (k) i-s-i-i-i-I; (l) i-i-s-i-i-I; (m) i-i-i-s-i-I; 
and (n) I-I-I-I-S-I, wherein I corresponds to an identical 
codon of a tandem repeat and S corresponds to a spacer codon. 

Preferably, a spacer codon is efficiently translated in 
the cell or tissue type relative to other synonymous codons. 
This is important so that translation of the spacer codon is 
not rate limiting. The neutral amino acid includes, but is 
not restricted to, alanine and glycine. 

The reporter polynucleotide can encode any suitable 
protein for which expression can be detected directly or 
indirectly such as by suitable assay. Suitable reporter 
polynucleotides include, but are not restricted to, 
polynucleotides encoding (3-galactosidase , firefly luciferase, 
alkaline phosphatase, chloramphenicol acetyltransf erase 
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(CAT) / ^-glucuronidase (GUS) , herbicide resistance genes such 
as the bialophos resistance (BAR) gene that confers 
resistance to the herbicide RASTA, and green fluorescent 
protein (GFP) . Assays for the activities associated with 
such proteins are known by those of skill in the art. 
Preferably, the reporter polynucleotide encodes GFP - 

Persons of skill in the art will appreciate that 
reporter polynucleotides need not correspond to a full-length 
gene encoding a particular reporter protein- In this regard, 
the invention also contemplates reporter polynucleotide sub- 
sequences encoding desired portions of a parent reporter 
protein, wherein an activity or function of the parent 
protein is retained in said portions. A polynucleotide sub- 
sequence encodes a domain of the reporter protein having an 
15 activity associated therewith and preferably encodes at least 
10, 20, 50, 100, 150, or 500 contiguous amino acid residues 
of the reporter protein. 

The instant method is applicable to any suitable cell 
or tissue type and, hence, is not restricted to application 
20 to mammalian cells/tissues. Accordingly, the cell or tissue 
type can be of any animal or plant origin. The cell or 
tissue type can be of any suitable lineage. For example, a 
suitable cell can include a eukaryotic cell, and preferably a 
cell or cell line capable of being grown in vitro. Suitable 
25 cell lines can include, for example, CV-1 cells, COS cells, 
yeast or spodoptera cells. The invention also contemplates 
cells that can be prokaryotic in origin. 

Suitable methods for isolating particular cells or 
tissues are known to those of skill in the art. For example, 
30 one can take advantage of one or more particular 
characteristics of a cell or tissue to specifically isolate 
the cell or tissue from a heterogeneous population. Such 
characteristics include, but are not limited to, anatomical 
location of a tissue, cell density, cell size, cell 
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morphology, cellular metabolic activity, cell uptake of ions 
such as Ca 2+ , K + , and H + ions, cell uptake of compounds such 
as stains, markers expressed on the cell surface, protein 
fluorescence, and membrane potential. Suitable methods that 
can be used in this regard include surgical removal of 
tissue, flow cytometry techniques such as fluorescence- 
activated cell sorting (FACS) , immunoaf f inity separation 
{e.g., magnetic bead separation such as Dynabead™ 
separation), density separation (e.g., metrizamide, Percoll™, 
or Ficoll™ gradient centrif ugat ion) , and cell-type specific 
density separation. 

In an alternate embodiment, progenitor cells or tissues 
can be used for initially introducing the synthetic 
construct. Any suitable progenitor cell or tissue can be 
used which gives rise to a particular cell or tissue of 
interest for which codon preference is to be ascertained. 
For example, a suitable progenitor cell can comprise an 
undifferentiated cell. In the case of a plant, a suitable 
progenitor cell and tissue can include a meristematic cell 
and a callus tissue, respectively. 

In another embodiment, the synthetic construct can be 
introduced first into an organism or part thereof before 
subsequent expression of the construct in a particular cell 
or tissue type. Any suitable organism is contemplated by the 
invention including unicellular and as multi-cellular 
organisms. Exemplary multi-cellular organisms include plants 
and animals such as mammals ( e. g. , humans). 

The invention further provides a synthetic construct 
comprising a reporter polynucleotide fused in frame with a 
tandem repeat of (e.g., 2, 3, 4, 5, 6, or 7 or more) 
identical codons, wherein said reporter polynucleotide 
encodes a reporter protein, and wherein said synthetic 
construct is operably linked to one or more regulatory 
polynucleotides . 
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The construction of the synthetic construct can be 
effected by any suitable technique. For example, in vitro 
mutagenesis methods can be employed, which are known to those 
of skill in the art. Suitable mutagenesis methods are 
described for example in the relevant sections of Ausubel, et 
al. (supra) and of Sambrook, et al. r (supra) which are 
incorporated herein by reference. Alternatively, suitable 
methods for altering DNA are set forth, for example, in U.S. 
Patent Nos . 4,184, 917, 4, 321,365 and 4, 351, 901, which are 
incorporated herein by reference. Instead of in vitro 
mutagenesis, the synthetic construct can be synthesized de 
novo using readily available machinery. Sequential synthesis 
of DNA is described, for example, in U.S. Patent No 
4,293,652, which is incorporated herein by reference. 
However, it should be noted that the present invention is not 
dependent on, and not directed to, any one particular 
technique for constructing the synthetic construct. 

Regulatory polynucleotides which can be utilized to 
regulate expression of the synthetic construct include, but 
are not limited to, a promoter, an enhancer, and a 
transcriptional terminator. Such regulatory polynucleotides 
are known to those of skill in the art. The construct 
preferably comprises at least one promoter. Suitable 
promoters that can be utilized to induce expression of the 
polynucleotides of the invention include constitutive 
promoters and inducible promoters . 

The step of introducing the synthetic construct into a 
particular cell or tissue type, or into a progenitor cell or 
tissue thereof, or into an organism or part thereof for 
subsequent introduction into a particular cell or tissue will 
differ depending on the intended use and or species, and may 
involve lipofection, electroporation, micro-projectile 
bombardment infection by Agrobacterium tumefaciens or A 
rhizogenes, or protoplast fusion. Such methods are known to 
those skilled in the art. 
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Alternatively, the step of introduction may involve 
non-viral and viral vectors, cationic liposomes, retroviruses 
and adenoviruses such as, for example, described in Mulligan, 
R.C., (1993 Science 260 926-932) which is incorporated herein 
5 by reference. Such methods may include: 

A. Local application of the synthetic nucleic acid sequence 
by injection (Wolff et al., 1990, Science 247 1465-1468, 
which is incorporated herein by reference) , surgical 
implantation, instillation or any other means. This 

10 method may also be used in combination with local 

application by injection, surgical implantation, 
instillation or any other means, of cells responsive to 
the reporter protein encoded by the synthetic construct. 
This method may also be used in combination with local 

15 application by injection, surgical implantation, 

instillation or any other means, of another factor or 
factors required for the activity of said reporter 
protein . 

B. General systemic delivery by injection of DNA, (Calabretta 
20 et al., 1993, Cancer Treat. Rev. 19 169-179, which is 

incorporated herein by reference) , or RNA, alone or in 
combination with liposomes (Zhu et al., 1993, Science 261 
209-212, which is incorporated herein by reference), viral 
capsids or nanoparticles (Bertling et al., 1991, Biotech. 

25 Appl. Biochem. 13 390-405, which is incorporated herein by 

reference) or any other mediator of delivery. Improved 
targeting might be achieved by linking the synthetic 
construct to a targeting molecule (the so-called "magic 
bullet" approach employing for example, an antibody) , or 

30 by local application by injection, surgical implantation 

or any other means, of another factor or factors required 
for the activity of the protein produced from said 
synthetic construct, or of cells responsive to said 
reporter protein. 
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C. Injection or implantation or delivery by any means, of 
cells that have been modified ex vivo by transfection (for 
example, in the presence of calcium phosphate: Chen et 
al., 1987, Mole. Cell Biochem. 7 2745-2752, or of cationic 
lipids and polyamines: Rose et al., 1991, BioTech. 10 520- 
525, which articles are incorporated herein by reference), 
infection, injection, electroporation (Shigekawa et al., 
1988, BioTech. 6 742-751, which is incorporated herein by 
reference) or any other way so as to increase the 
expression of said synthetic construct in those cells. 
The modification may be mediated by plasmid, 
bacteriophage, cosmid, viral (such as adenoviral or 
retroviral; Mulligan, 1993, Science 260 926-932/ Miller, 
1992, Nature 357 455-460; Salmons et al., 1993, Hum. Gen. 
Ther. 4 129-141, which articles are incorporated herein by 
reference) or other vectors, or other agents of 
modification such as liposomes (Zhu et al., 1993, Science 
261 209-212, which is incorporated herein by reference) , 
viral capsids or nanoparticles (Bertling et al., 1991, 
Biotech. Appl. Biochem. 13 390-405, which is incorporated 
herein by reference), or any other mediator of 
modification. The use of cells as a delivery vehicle for 
genes or gene products has been described by Barr et al., 
1991, Science 254 1507-1512 and by Dhawan et al., 1991, 
Science 254 1509-1512, which articles are incorporated 
herein by reference. Treated cells may be delivered in 
combination with any nutrient, growth factor, matrix or 
other agent that will promote their survival in the 
treated subject. 

Advantageously, the translational efficiencies of 
different codons may be determined by comparing expression of 
the reporter protein in a given cell or tissue type or 
between different cell or tissue types. One of ordinary 
skill in the art will thereby be able to determine a "codon 
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preference table" for one or more cells or tissues. 
Comparison of codon preference tables relating to different 
cell or tissue types may be used to identify codons for 
tailoring a synthetic polynucleotide to target expression of 
a protein to a particular cell or tissue, as described 
hereinafter. Comparison of codons within a codon preference 
table for a particular cell or tissue type can be used to 
identify codons for tailoring a synthetic polynucleotide to 
express a protein at higher or lower levels in that cell or 
tissue type than a parent polynucleotide, as described 
hereinafter . 

The invention further contemplates cells or tissues 
containing therein the synthetic construct of the invention, 
or alternatively, cells or tissues produced from the method 
of the invention. 

3. Synthetic polynucleotides for targeting protein expression 
to a particular cell or tissue 

The invention also provides an improved method of 
constructing a synthetic polynucleotide from which a protein 
is selectively expressible in a target cell of an organism, 
relative to another cell of the organism. This method is 
based in part on the method disclosed in copending 
International application PCT/AU98/00530 (the entire contents 
of which are hereby incorporated by reference) in which a 
first codon of a parent polynucleotide is replaced with a 
synonymous codon which has a higher translat ional efficiency 
in said target cell than in said other cell. The improved 
method of the invention is characterized by selecting the 
first and synonymous codons by comparing translat ional 
efficiencies of individual codons in said target cell 
relative to said other cell using the method broadly 
described is Section 2. 
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3.2. Selection of synonymous and first codons 

The present method preferably includes the step of 
selecting the codons such that the synonymous codon has a 
higher translational efficiency in said target cell or tissue 
5 ("cell or tissue" is sometimes referred to herein as 
xx cell/tissue" ) relative to said one or more other cells or 
tissues . 

A method for determining translational efficiencies of 
different codons in and between different cells or tissues is 

10 described in detail in Section 2. The translational 

efficiencies so determined can be used to identify which 
isocoding triplets are differentially translated between the 
different cells or tissues. In a typical scenario, there 
will be: (A) codons with higher translational efficiencies in 

15 a target cell/tissue relative to one or more other 
cells/tissues; (B) codons with higher translational 
efficiencies in the one or more other cells/tissues relative 
to the target cell/tissue; and (C) codons with about the same 
translational efficiencies in the target cell/tissue relative 

20 to the one or more other cells/tissues. Synonymous codons 
are selected such that they correspond to (A) codons. 
Preferably, a synonymous codon is selected such that it has 
the largest difference in translational efficiency in the 
target cell or tissue relative to the existing codon 

25 (sometimes referred to as a "first codon" ) that it replaces. 

Existing codons in a parent polynucleotide are preferably 
selected such that they do not have the same translational 
bias as the synonymous codons with respect to the target 
cell/tissue and the one or more other cell/tissue (i.e., 

30 existing codons should preferably not correspond to (A) 
codons) . However, existing codons can have similar 

translational efficiencies in each of the target cell/tissue 
and the one or more other cells/tissues (i.e., existing 
codons can correspond to (C) codons. They can also have a 

35 translational bias opposite to that of the synonymous codons 
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(i.e., existing codons can, and preferably do, correspond to 
(B) codons) . 

Suitably, a synonymous codon has a translational 
efficiency in the target cell/tissue that is at least 110%, 
preferably at least 200%, more preferably at least 500%, and 
still more preferably at least 1000%, of that in the other 
cell (s) /tissue (s) . In the case of two or more synonymous 
codons having similar translational efficiencies in the 
target cell/tissue relative to the other cell ( s) /tissue ( s ) , 
it will be appreciated that any one of these codons can be 
used to replace the existing codon. 

It is preferable but not necessary to replace all the 
existing codons of the parent polynucleotide with synonymous 
codons having higher translational efficiencies in the target 
cell/tissue compared to the other cells/tissues. Increased 
expression can be accomplished even with partial replacement. 
Suitably, the replacement step affects 5%, 10%, 15%, 20%, 
25%, 30%, more preferably 35%, 40%, 50%, 60%, 70% or more of 
the existing codons of the parent polynucleotide. 

The difference in level of protein expressed in the 
target cell/tissue from a synthetic polynucleotide relative 
to that expressed in the other cell ( s) /tissue ( s ) depends on 
the percentage of existing codons replaced by synonymous 
codons, and the difference in translational efficiencies of 
the synonymous codons in the target cell/tissue relative to 
the other cell (s) /tissue (s) . Put another way, the fewer such 
replacements, and/or the smaller the difference in 
translational efficiencies of the synonymous between the 
different cells/tissues, the smaller the difference in 
protein expression between the target cell/tissue and the 
other cell (s) /tissue (s) will be. Conversely, the more such 
replacements, and/or the greater the difference in 
translational efficiencies of the synonymous codons between 
the different cells/tissues, the greater the difference in 
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protein expression between the target cell/tissue and the 
other cell (s) /tissue (s) will be. The inventors have found in 
this respect that a protein can be expressed from a synthetic 
polynucleotide in a target cell/tissue at levels greater than 
5 10,000-fold over those expressed in another cell/tissue. 

In a preferred embodiment, the synonymous codon is a 
codon which has a higher translational efficiency in the 
target cell or tissue relative to a precursor cell or tissue 
of the target cell or tissue. 

10 In an alternate embodiment, the synonymous codon is a 

codon which has a higher translational efficiency in the 
target cell or tissue relative to a cell or tissue derived 
from said target cell or tissue. 

The two codons can be selected by measuring 
15 translational efficiencies of different codons in the target 
cell or tissue relative to the one or more other cells or 
tissues and identifying the at least one existing codon and 
the synonymous codon based on this measurement . 

Suitably, the synonymous codon corresponds to a 
20 reporter construct from which the reporter protein is 
expressed in said target cell at a level that is at least 
110%, preferably at least 200%, more preferably at least 
500%, and most preferably at least 1000%, of that expressed 
from the said reporter construct in said other cell. 

25 3.2. Construction of synthetic polynucleotides 

The step of replacing a synonymous codon for said first 
codon in a parent polynucleotide may be effected by any 
suitable technique. For example, in vitro mutagenesis 

methods may be employed as for example discussed in Section 
30 2. 

It is not necessary to replace all the first codons of 
the parent polynucleotide with synonymous codons each 
corresponding to a codon that has a higher translational 
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efficiency in the target cell relative to said other cell. 

Increased expression may be accomplished even with partial 

replacement- Preferably, the replacing step affects 5%, 10%, 

15%, 20%, 25%, 30%, more preferably 35%, 40%, 50%, 60%, 70% 

5 or more of the existing codons of the parent nucleic acid 
sequence . 

The parent polynucleotide is preferably a natural gene. 

The parent polynucleotide may be obtained from a plant 
or an animal. Alternatively, the parent polynucleotide may 

10 be obtained from any other eukaryotic organism or a 
prokaryotic organism. In a preferred embodiment, the parent 
polynucleotide is obtained from a pathogenic organism. In 
such a case, a natural host of the pathogenic organism is 
preferably a plant or animal. For example, the pathogenic 

15 organism may be a yeast, bacterium or virus. However, it 
will be understood that the parent polynucleotide need not be 
obtained from the organism in which a protein is to be 
expressed but may be obtained from any suitable source such 
as from another eukaryotic or prokaryotic organism. 

20 Suitable proteins which may be used for selective 

expression in accordance with the invention include, but are 
not limited to the cystic fibrosis transmembrane conductance 
regulator (CFTR) protein, and adenosine deaminase (ADA) . In 
the case of CFTR, a parent nucleic acid sequence encoding the 

25 CFTR protein which may be utilized to produce the synthetic 
nucleic acid sequence is described, for example, in Riordan 
et al (1989, Science 245 1066-1073), and in the GenBank 
database under Accession No. HUMCFTRM, which are incorporated 
herein by reference. 

30 Regulatory polynucleotides which may be utilized to 

regulate expression of the synthetic polynucleotide include, 
but are not limited to, a promoter, an enhancer, and a 
transcriptional terminator. Such regulatory polynucleotides 
are known to those of skill in the art. The construct 
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preferably comprises at least one promoter. Suitable 
promoters that can be utilized to induce expression of the 
synthetic polynucleotides of the invention include 
constitutive promoters and inducible promoters. 
5 Synthetic polynucleotides according to the invention 

may be operably linked to one or more regulatory sequences in 
the form of an expression vector. 

The invention also contemplates synthetic 

polynucleotides encoding one or more desired portions - of the 
10 protein to be expressed. A polynucleotide encodes a domain 
of the protein having a function associated therewith, or 
which is otherwise detectable, and preferably encodes at 
least 10, 20, 50, 100, 150, or 500 contiguous amino acid 
residues of the protein. 

15 4. Synthetic polynucleotides for enhanced protein expression 

in a particular cell or tissue 

In contrast to differential protein expression between 
different cells/tissues, it will be appreciated that a 
synthetic polynucleotide may be tailored with synonymous 

20 codons such that expression of a protein in a target cell is 
enhanced. In this regard, the difference in level of protein 
expressed in the target cell/tissue from a synthetic 
polynucleotide relative to that expressed from a parent 
polynucleotide depends on the percentage of existing codons 

25 replaced by synonymous codons, and the difference in 
translational efficiencies between the existing codons and 
the synonymous codons in the target cell/tissue. Put another 
way, the fewer such replacements, and/or the smaller the 
difference in translational efficiencies between the 

30 synonymous and existing codons, the smaller the difference in 
protein expression between the synthetic polynucleotide and 
parent polynucleotide will be. Conversely, the more such 
replacements, and/or the greater the difference in 
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translational efficiencies between the synonymous and 
existing codons, the greater the difference in protein 
expression between the synthetic polynucleotide and parent 
polynucleotide will be. The inventors have found in this 
respect that a protein can be expressed from a synthetic 
polynucleotide in a target cell/tissue at levels greater than 
10,000-fold than from a parent polynucleotide. 

Preferably, the at least one existing codon and the 
synonymous codon are selected such that said protein is 
expressed from said synthetic polynucleotide in said target 
cell or tissue at a level which is at least 110%, preferably 
at least 200%, more preferably at least 500%, and most 
preferably at least 1000%, of that expressed from said parent 
polynucleotide in said target cell or tissue. 



The invention is further described with reference to 
the following non-limiting examples. 

EXAMPLE 1 

Construction of expression vectors for determining 
relative codon preferences in mammalian cells. 

Synthetic gfp genes were constructed in which a single 
artificial start codon (ATG) followed by a stretch of five 
identical codons is fused in frame immediately upstream of a 
gfp coding sequence. A reverse oligonucleotide primer (SEQ 
ID NO: 185; sequence complementary to the termination codon 
for GFP, is underlined) , and a suite of forward 
oligonucleotide primers (SEQ ID NO: 126 through 184; the 
first codon of GFP, is underlined) were synthesized and used 
for PCR amplification of a humanized gfp gene (SEQ ID NO: 124) 
(GIBCO) as template with Tag DNA polymerase (Amplification 
parameters: 95°C/30 sec; 52°C/30 sec; 72°C/1 min; 30 cycles) . 
The amplified fragments have nucleic acid sequences and 
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deduced amino acid sequences as shown in SEQ ID NO:l through 
124 . 

In summary, the synthetic fragments contain an 
artificial start codon followed by a tandem repeat of five 
identical codons specific for a given iso-tRNA species. The 
tandem repeat immediately precedes the second codon of the 
gfp gene. The synthetic fragments by SEQ ID NO, and encoded 
tandem repeat, are presented in the TABLE 1. 

The amplified fragments were cloned between the £coRI 
and Kpnl sites of the mammalian expression vector pCDNA3 
containing SV4 0 ori (Invitrogen) and the CMV promoter. 

Transfection of COS-1 cells 

COS-1 cells were grown continuously in DMEM media 
supplemented with 10% fetal calf serum (FCS) , glutamine, 
penicillin and streptomycin. Cells were passaged from a 150 
cm 2 flask into multiple 25 cm 2 flasks. Cells were 

transfected using a QIAGEN Effectene" transfection kit (and 
the manufacturer's instructions, incorporated herein by 
reference) when confluency of the cells was between 60-80%. 
Briefly, 1 ug of plasmid DNA was diluted into 10 uL of 
filtered TE buffer and 140 \iL of QIAGEN™ Buffer EC. Eight 
microliters of QIAGEN™ Enhancer was added followed by 
vortexing and incubation at room temperature for 2-5 min. 
QIAGEN™ Effectene (10 uL) was added followed by vortexing 
for 10 seconds and a further incubation at room temperature 
for 10 min. The cells were washed once in lx PBS followed by 
re-suspension in fresh media (1 mL) . After 48 hrs, cells 
were harvested and washed in lx PBA (phosphate buffered 
saline plus azide) . Cells adhering to the flask were removed 
by scraping with a cell scraper. Cells were then filtered 
through a 70 urn filter before addition of 300 ui of 2% 
paraformaldehyde and 300 uL of lOx FCS. Cells were kept on 
ice in the dark until FACS analysis. 
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Synthetic gfp mRNA expression of transfected cells was 
tested by reverse transcriptase PCR. GFP protein expression 
was analyzed by confocal microscopy and flow cytometry. 

Confocal microscopy 
5 Transfected COS-1 cells were examined using a Bio-Rad 

MRC-600 laser-scanning confocal microscope equipped with a 
krypton-argon laser and filter sets suitable for the 
detection of fluorescein arid Texas red dyes (Bio-Rad KlyK2), 
and a Nikon 603 PlanApo™ numerical aperture 1.2 water- 
10 immersion objective. Dual-channel confocal images and video 
montages of the transfected cells can be suitably composed 
using ADOBE PhotoShop™. 

Flow cytometry 

Transfected COS-1 cells were analyzed with a Becton 
15 Dickinson™ Flow cytometer Elite II. Omega Filters™ allowed 
detection of green fluorescence emission (EMI510/20 
collects light from 490-530 nm) and yellow fluorescence 
emission (EM2 550/30 - collects light form 525-580 nm) from 
the transfected cells. 

20 Results 

A series of 64 reporter constructs (see TABLE 1) was 
made and validated, in which the gfp gene is preceded in 
frame by a tandem repeat of 5 identical codons . Together, 
the series 'covers the entire set of isoaccepting codon 

25 triplets. 

The series was transfected into a single cell line, and 
expression levels measured by flow cytometry (see TABLE 2) . 
Overall, the expression level of the reporter gene constructs 
in the cell line varied over a range of 20-fold, according to 

30 the triplet used in the reporter construct. Repeated 
determinations on the same construct showed excellent inter- 
assay reproducibility (r 2 = 0.9). Variation in expression 
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levels across the isoaccepting codons for a single amino acid 
ranged from 1.4-fold for valine to 13-fold for threonine, 
with a median of about 4-fold. Variation in expression 
between amino acids was of the same order of magnitude. The 
5 order of magnitude of the effect is defined as an average of 

4 fold per amino acid if 5 copies are incorporated, 
compatible with an extreme in range of expression levels of 
up to (1 . 6) 200 = 10 86 over an average 200-amino acid residues 
protein. This figure is derived as: 

10 [1 + ((4-1) ( range of reporter construct expression)/ 

. j_ \ \ -t 200 (no of amino acid 

5 (no of triplets in the reporter construct))] 

residues in the protein) 

and is more than sufficient to explain the observed 
differences in expression of mammalian genes according to 

15 codon usage. 

The results presented in TABLE 2 also show that various 
codons in the undifferentiated epithelial cells (COS-1) have 
translational efficiencies at least two-fold higher or two- 
fold lower relative to those of their corresponding 

20 synonymous codons. Representative codons having at least a 
two-fold higher translational efficiency relative to at least 
one of their corresponding synonymous codons include aga 
(Arg) , egg (Arg) , tgc (Cys) , gga (Gly) , ggc (Gly) , ccg (Pro), 
cga (Pro), aca (Thr) , acg (Thr) , and act (Thr) . Thus, these 

25 codons appear to be preferred for translation in the 
undifferentiated epithelial cells. By contrast, 

representative codons having at least a two-fold lower 
translational efficiency relative to at least one of their 
corresponding synonymous codons include agg (Arg) , tgt (Cys) , 

30 ggg (Gly), ggt (Gly), ccc (Pro), cct (Pro), and acc (Thr). 

These latter codons would therefore appear to be less 
preferred for translation in the undifferentiated epithelial 
cells. Accordingly, if higher protein expression is required 
within undifferentiated epithelial cells such as COS-1 cells, 
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the preferred codons should be used to replace any existing 
codons of a parent polynucleotide encoding the protein that 
correspond to the less preferred codons. In this respect, a 
codon substitution algorithm for increasing protein 
5 expression in non-differentiated epithelial cells is 
presented in TABLE 3. However, if lower protein expression 
is required in non-differentiated epithelial cells, the less 
preferred codons should be used to replace any existing 
codons of the parent polynucleotide that correspond to the 
10 preferred codons. 



The disclosure of every patent, patent application, and 
publication cited herein is hereby incorporated by reference 
in its entirety. 

15 The present invention has been described in terms of 

particular embodiments found or proposed by the present 
inventors to comprise preferred modes for the practice of the 
invention. Those of skill in the art will appreciate that, 
in light of the present disclosure, numerous modifications 

20 and changes can be made in the particular embodiments 
exemplified without departing from the scope of the 
invention. All such modifications are intended to be 
included within the scope of the appended claims. 



o 



WO 00/4221 5 PCT/AUOO/00008 

- 32 - 



TABLE 1 

Synthetic gfp constructs are tabulated by SEQ ID NO and 
by the codon corresponding to the tandem repeat of five 
identical codons immediately upstream of the gfp gene. 

5 TABLE 2 

Mean fluorescence intensities of up to four different 
samples of transiently transfected COS-1 cells are shown 
(Green mean 1-4). Synthetic gfp constructs are tabulated by 
SEQ ID NO and by the codon corresponding to the tandem repeat 
10 immediately upstream of the gfp gene. 

TABLE 3 

Input codons and output codons represent, respectively, 
synonymous codons and existing (i.e., "first") codons 
according to the invention. Change means an actual change of 
15 a codon. 
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TABLES 

TABLE 1 



Synthetic fragments and tandem repeats encoded thereby. 



SEQ ID NO 


Tandem repeat 


SEQ ID NO 


Tandem repeat 


1 


Ala 


(GCA) 


X 


5 


O D 


Leu 


(CTT) 


x 5 


3 


Ala 


(GCC) 


X 


5 


O / 


Leu 


(TTA) 


x 5 


5 


Ala 


(GCG) 


X 


5 


£Q 

O j 


Leu 


(TTG) 


x 5 


7 


Ala 


(GCT) 


X 


5 


/ X 


Lys 


(AAA) 


x 5 


9 


Arg 


(AGA) 


X 


5 




Lys 


(AAG) 


x 5 


11 


Arg 


(AGG) 


X 


5 




Phe 


(TTT) 


x 5 


1 3 


Arg 


(CGA) 


X 


5 


7 7 


Phe 


(TTC) 


x 5 


1 b 


Arg 


(CGC) 


X 


5 


7 9 


Pro 


(CCC) 


x 5 


T "7 

X / 


Arg 


(CGG) 


X 


5 


81 


Pro 


(CCG) 


x 5 


-I Q 


Arg 


(CGT) 


X 


5 


83 


Pro 


(CCT) 


x 5 


O 1 
Z X 


Asn 


(AAC) 


X 


5 


85 


Pro 


(CGA) 


x 5 




Asn 


(AAT) 


X 


5 


87 


Ser 


(AGC) 


x 5 


25 


nop 






5 


89 


Ser 


(AGT) 


x 5 


27 


Asp 


(GAT) 


X 


5 


91 


Ser 


(TCA) 


x 5 


29 


Cys 


(TGC) 


X 


5 


93 


Ser 


(TCC) 


x 5 


31 


Cys 


(TGT) 


X 


5 


95 


Ser 


(TCG) 


x 5 


33 


Gin 


(CAA) 


X 


5 


97 


Ser 


(TCT) 


x 5 


35 


Gin 


(CAG) 


X 


5 


99 


Thr 


(ACA) 


x 5 


37 


Gly 


(GAA) 


X 


5 


101 


Thr 


(ACC) 


x 5 


39 


Gly 


(GAG) 


X 


5 


103 


Thr 


(ACG) 


x 5 


41 


Gly 


(GGA) 


X 


5 


105 


Thr 


(ACT) 


x 5 
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SEQ ID NO 


Tandem repeat 


SEQ ID NO 


Tandem repeat 


43 


Gly 


(GGC) 


X 


5 


107 


Trp 


(TGG) 


x 5 


45 


Gly 


(GGG) 


X 


5 


109 


Tyr 


(TAT) 


x 5 


47 


Gly 


(GGT) 


X 


5 


111 


Tyr 


(TAC) 


x 5 


49 


His 


(CAC) 


X 


5 


113 


Val 


(GTA) 


x 5 


51 


His 


(CAT) 


X 


5 


115 


Val 


(GTC) 


x 5 


53 


He 


(ATA) 


X 


5 


117 


Val 


(GTG) 


x 5 


55 


He 


(ATC) 


X 


5 


119 


Val 


(GTT) 


x 5 


57 


He 


(ATT) 


X 


5 


121 


Stop 


(TAA) 


x 5 


59 


Leu 


(CTA) 


X 


5 


122 


Stop 


(TAG) 


x 5 


61 


Leu 


(CTC) 


X 


5 


123 


Stop 


(TGA) 


x 5 


63 


Leu 


(CTG) 


X 


5 


124 


control 
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TABLE 2 



GFP protein expression in transiently transfected COS-1 



SEQ ID 


Codon 


r n]\i 7A i 

l Uviri J 


Croon 




Green 


Green 


Average 


NO 




{ jjg/m.Li ; 


mean 
± 


1 lit; d 1 1 
O 

z 


mean 

3 


mean 
4 




1 


Ala (GCA) 


1 . 07 


• 45.70 


54 . 4U 






50 . 05 


3 


Ala (GCC) 


T 1 A 
1.10 


4 j . / U 


CL A A A 

jU . uu 






46.85 


5 


Ala (GCG) 


0 . 03 


28 . 50 


42.40 






J J . 4 J 


7 ! 


Ala (GCT) 


0 .56 


11 . 60 


48.30 






9 Q QS 


9 


Arg (AGA) 


0 . 90 


29.00 


33 . 00 






31 00 


11 


Arg (AGG) 


0 . 34 


7 . 35 


2 . 88 






o 12 


13 


Arg (CGA) 


1 . 00 


18.30 


1/1 OA 

14 . zU 






16.25 


15 


Arg (CGC) 


0.86 


14 . 60 


n x~ A A 

lb, UU 






15 . 30 


17 


Arg (CGG) 


1 . 00 


22.50 


oa a c\ 
zU . bU 






21 . 55 


19 


Arg (CGT) 


0.68 


o 1 o a 
Z 1 . / U 


o o on 

jjZ . ZU 






26.95 


21 


Asn (AAC) 


n no 

U - UZ 












23 


Asn (AAT) 


n oo 
U . j> o 




q 99 






18 .26 


25 


Asp (GAC) 


U.4D 










21 . 35 


O "1 

27 


Asp v LjAI ; 




i a so 


1 8 90 






16. 70 


O A 


cys ( i 


U . DO 


9 1 QO 


16.10 






19.00 


O "1 

31 


cys ( l Lii ; 


1 1/1 


o Q S 


5.89 






5. 92 


33 


Oin 


u . z o 


9 SO 


43 50 






35 . 00 


Jb 


Ljj_n (CAtj/ 


1 Q ft 




48 . 60 






46. 65 


37 


Glu (GAA) 


0. 60 


10. 30 


22.70 






16.50 


39 


Glu (GAG) 


0.43 


3.86 










41 


Gly (GGA) 


0.33 


28.80 


36 . 30 






32. 55 


43 


Gly (GGC) 


1.62 


17. 80 


28 . 10 






22 . 95 


45 


Gly (GGG) 


1.15 


6.43 


4 . 96 






5 .70 


47 


Gly (GGT) 


1.39 


7 . 12 


4 .02 






5. 57 


49 


His (CAC) 


1.62 


29. 90 


39 .70 






34 . 80 


51 


His (CAT) 


1.69 


43.40 


37 .20 






40 . 30 


1 53 


He (ATA) 


0.69 


2.76 


3 . 98 






3.37 
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SEQ ID 


Codon 


[DNA] 


Green 


Green 


Green 


Green 


Average 


NO 






(pg/mL) 


mean 
1 


mean 
2 


mean 
3 


mean 
4 




55 


He 


(ATC) 


1. 52 


4 . 12 


2.83 






3 . 48 


57 


He 


(ATT) 


1.77 


3. 19 


3. 16 






3. 18 


59 


Leu 


(CTA) 


0.10 


15. 00 


3.01 


5.26 


2.44 


6.43 


61 


Leu 


(CTC) 


1.74 


2.70 


2. 92 


2. 56 




2 . 73 


63 


Leu 


(CTG) 


0.41 


2. 80 


7 . 51 


2. 63 




4 . 31 


65 


Leu 


(CTT) 


1.43 


' 3. 17 


3. 56 


2.70 




3.14 


67 


LeU 


(TTA) 


0. 62 


3. 85 


3. 91 


2. 66 




3.47 


69 


Leu 


(TTG) 


0.70 


2.87 


4 . 63 


2.85 




' 3.45 


71 


Lys 


(AAA) 


0.10 


11. 90 


8.24 






10 . 07 


73 


Lys 


(AAG) 


0.56 


19.20 


16. 00 






17 . 60 


75 


Phe 


(TTT) 


2.28 


2. 67 










77 


Phe 


(TTC) 


1. 65 


4.35 










79 


Pro 


(CCC) 


0.40 


12. 00 


8.95 






10 .48 


81 


Pro 


(CCG) 


0.13 


17. 40 


25. 40 






21. 40 


83 


Pro 


(CCT) 


0.40 


10. 60 


9.89 






10.25 


85 


Pro 


(CGA) 


0.17 


27 . 20 


34 . 80 






31. 00 


87 


Ser 


(AGC) 


0.03 


62. 40 










89 


Ser 


(AGT) 


0.81 


23. 10 










91 


Ser 


(TCA) 


0.08 


30.70 


37 . 20 






33. 95 


93 


Ser 


(TCC) 


1. 68 


32. 90 










95 


Ser 


(TCG) 


1. 58 


60. 00 










97 


Ser 


(TCT) 


0. 62 


26. 80 


40.70 






33.75 


99 


Thr 


(ACA) 


1.70 


37. 80 


39.90 






38 . 85 


101 


Thr 


(ACC) 


7.69 


3. 48 


2 .75 






3 . 12 


103 


Thr 


(ACG) 


1.06 


36. 10 


44 . 10 






40 . 10 


105 


Thr 


(ACT) 


1.42 


38. 80 


42 . 60 






40.70 


107 


Trp 




1.19 


5. 21 


4.29 






4 . 75 




Tyr 


(TAT) 


n 09 
\j * \j 












in 


Tyr 


(TAC) 


1 . 07 


12. 00 


15 . 00 






13. 50 


113 


Val 


(GTA) 


0.16 


10. 50 


3.81 






7 . 16 


115 


Val 


(GTC) 


0. 66 


15.20 


4 .55 


3. 65 


5. 06 


7 . 12 


117 


Val 


(GTG) 


0. 10 


9. 17 


4.29 


7. 03 


2; 36 


5.71 
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SEQ ID 


Codon 


[DNA] 


Green 


Green 


Green 


Green 


n v ci- ay 


NO 




(yg/mL) 


mean 
1 


mean 
2 


mean 
3 


mean 
4 




119 
121 


Val (GTT) 
stop 


0 . 49 
1.88 


14.10 
39.40 


2 . 63 
35 . 30 


3 . 70 


2.49 


ex. 1 "5 
*3 "7 It: 


122 


(TAA) 
stop 
(TAG) 


2.86 


2.88 


3.28 






j . Uo 


123 


stop 


0. 02 












124 


( 1 taA) 




' 9.34 


61. 60 


30. 40 


55. 00 


39. 09 


GFP 
















alone 
control 






; 2.33 


2.21 


2. 16 


2. 00 


2. 18 
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TABLE 3 

Substitution algorithm used for high level expression in non 
differentiated epithelial cells 



Input Codon 


Output Codon 


Amino Acid 


Change 


AAA 


AAG 


LYS | 


Yes 


AAC 


AAC 


ASN 


No 


AAG 


AAG 


LYS 


No 


AAT 


AAC 


ASN 


Yes 


AAU 


AAC 


ASN 


Yes 


ACA 


ACC 


THR 


Yes 


ACC 


ACC 


THR 


No 


ACG 


ACC 


THR 


Yes 


ACT 


ACC 


THR 


Yes 


ACU 


ACC 


THR 


Yes 


AGA 


AGG 


ARG 


Yes 


AGC 


AGC 


SER 


No 


AGG 


AGG 


ARG 


No 


AGT 


AGC 


SER 


Yes 


AGU 


AGC 


SER 


Yes 


ATA 


ATC 


ILE 


Yes 


ATC 


ATC 


ILE 


No 


ATG 


ATG 


MET 


No 


ATT 


ATC 


ILE 


Yes 


ADA 


ATC 


ILE 


Yes 


AUC 


ATC 


ILE 


No 


AUG 


ATG 


MET 


NO 


AUU 


ATC 


ILL 


i es 


PA A 


CAG 


GLN 


Yes 


CAC 


CAC 


HIS 


No 


CAG 


CAG 


GLN 


No 


CAT 


CAC 


HIS 


Yes 


| CAU 


CAC 


HIS 


Yes ! 


CCA 


CCC 


PRO 


Yes 


CCC 


CCC 


PRO 


No 


| CCG 


CCC 


PRO 


Yes 


CCT 


CCC 


PRO 


Yes 


ecu 


CCC 


PRO 


Yes 


CGA 


CGC 


ARG 


Yes 



o 
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Input Codon 


Output Codon 


Amino Acid 


Change 


CGC 


CGC 


ARG | 


No 


CGG 


CGC 


ARG 


Yes 


CGT 


CGC 


ARG 


Yes i 


CGU 


CGC 


ARG 


Yes 


CTA 


CTG 


LEU j 


Yes 


CTC 


CTG 


LEU 


Yes 


CTG 


CTG 


LEU 


No 


CTT 


CTG 


LEU 


Yes 


CUA 


CTG 


LEU 


Yes 


cue 


CTG 


LEU 


Yes 


CUG 


CTG 


LEU 


No 


CUU 


CTG 


LEU 


Yes 


GAA 


GAG 


GLU 


Yes 


GAC 


GAC 


ASP 


No 


GAG 


GAG 


. GLU 


No 


GAT 


GAC 


ASP 


Yes 


GAU 


GAC 


ASP 


Yes 


GCA 


GCC 


ALA 


Yes 


GCC 


GCC 


ALA 


No 


GCG 


GCC 


ALA 


Yes 


GCT 


GCC 


ALA 


Yes 


GCU 


GCC 


ALA 


Yes 


GGA 


GGC 


GLY 


Yes 


GGC 


GGC 


GLY 


No 


GGG 


GGG 


GLY 


No 


GGT 


GGC 


GLY 


Yes 


GGU 


GGC 


GLY 


Yes 


GTA 


GTG 


VAL 


Yes 


GTC 


GTG 


VAL 


Yes 


GTG 


GTG 


VAL 


No 


GTT 


GTG 


VAL 


Yes 


GUA 


GTG 


VAL 


Yes 


GUC 


GTG 


VAL 


Yes 


GUG 


GTG 


VAL 


No 


GUU 


GTG 


VAL 


Yes 


TAA 


TAA 


XXX 


No 


TAC 


TAC 


TYR 


No 


TAG 


TAG 


XXX 


No 


TAT 


TAC 


TYR 


Yes 


TCA 


TCC 


SER 


Yes 
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Input Codon 


Output Codon 


Amino Acid 


Change 


TCC 


TCC 


SER 


No ! 


TCG 


TCC 


SER 


Yes- 


TCT 


TCC 


SER 


Yes 


TGA 


TGA 


XXX 


No 


TGC 


TGC 


CYS 


No 


TGG 


TGG 


TRP 


No 


TGT 


TGT 


CYS 


No 


TTA 


CTG 


LEU 


Yes 


TTC 


TTC 


PHE 


No 


TTG 


CTG 


LEU 


Yes 


TTT 


TTC 


PHE 


No 


UAA 


TAA 


XXX 


No 


UAC 


TAC 


TYR 


No | 


UAG 


TAG 


XXX 


No 


UAU 


TAC 


TYR 


Yes 


UCA 


TCC 


SER 


Yes 


UCC 


TCC 


SER 


No 


UCG 


TCC 


SER 


Yes 


UCU 


TCC 


SER 


Yes 


UGA 


TGA 


XXX 


No 


UGC 


TGC 


CYS 


No 


UGG 


TGG 


TRP 


No 


UGU 


TGT 


CYS 


No 


UUA 


CTG 


LEU 


Yes 


UUC 


TTC 


PHE 


No 


UUG 


CTG 


LEU 


Yes 


UUU 


TTC 


PHE 


Yes 



o 
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WH aT 15 PT.ATMED TS: 

1. A method for determining the t ranslat ional efficiency of 
an individual codon in a cell, said method comprising: 

- introducing into said cell a synthetic construct 
5 comprising a reporter polynucleotide fused in frame with a 

tandem repeat of said individual codon, wherein said 
reporter polynucleotide encodes a reporter protein, and 
wherein said synthetic construct is operably linked to a 
regulatory polynucleotide; and 

10 - measuring expression of said reporter protein in said 

cell to determine the translat ional efficiency of said 
codon. 

2. The method of claim 1, further comprising comparing: 

- expression of said reporter protein in a cell to 
15 which a synthetic construct comprising a tandem repeat of 

said individual codon was provided; and 

- expression of said reporter protein in a cell to 
which a synthetic construct comprising a tandem repeat of 
another individual codon was provided; 

20 to thereby determine the relative translat ional efficiency of 
said individual codons in said cell. 

3. The method of claim 1, further comprising comparing: 

- expression of said reporter protein in a cell to 
which a synthetic construct comprising a tandem repeat of 

25 said individual codon was provided; and 

- expression of said reporter protein in another cell 
to which a synthetic construct comprising a tandem repeat 
of said individual codon was provided; 

to thereby determine the translational efficiency of said 
30 individual codon in said cell relative to said other cell. 
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4. The method of claim 1, further comprising: 

-introducing the synthetic construct into a progenitor 
cell of said cell; and 

- producing said cell from said progenitor cell; 
wherein said cell contains said synthetic construct. 

5. The method of claim 1, further comprising 

- introducing the synthetic construct into a progenitor 
of said cell; and 

- growing an organism or part thereof from said 
progenitor cell; 

wherein said organism comprises said cell containing said 
synthetic construct . 

6. The method of claim 1, further comprising 

- introducing the synthetic construct into an organism 
or part thereof such that said synthetic construct is 
introduced into said cell. 

7 . A synthetic construct ' comprising a reporter 
polynucleotide fused in frame with a tandem repeat of 
individual codons, wherein said reporter polynucleotide 
encodes a reporter protein, and wherein said synthetic 
construct is operably linked to a regulatory polynucleotide. 

8. A vector comprising the synthetic construct of claim 7. 

9. A cell comprising the synthetic construct of claim 7. 

10. A cell comprising the vector of claim 9. 

11. An improved method of constructing a synthetic 
polynucleotide from which a protein is selectively 
expressible in a target cell of an organism, relative to 
another cell of the organism, said method comprising: 

- selecting a first codon of a parent polynucleotide 
for replacement with a synonymous codon which has a higher 
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translational efficiency in said target cell than in said 
other cell; and 

- replacing said first codon with said synonymous 
codon to form said synthetic polynucleotide, wherein said 
first codon and said synonymous codon are selected by: 

- comparing translational efficiencies of 
individual codons in said target cell relative to said 
other cell; and 

- selecting said first codon and said synonymous 
codon based on said comparison, wherein said comparison 
comprises ; 

- introducing into said target cell and said other 
cell a synthetic construct comprising a reporter 
polynucleotide fused in frame with a tandem repeat of 
an individual codon, wherein said reporter 
polynucleotide encodes a reporter protein, and wherein 
said synthetic construct is operably linked to a 
regulatory polynucleotide; and 

- comparing expression of said reporter protein in 
said target cell relative to said other cell; 

to thereby determine the translation efficiency of individual 
codons in said target cell relative to said other cell. 

12. The method of claim 11, wherein said synonymous codon 
corresponds to a reporter construct from which the reporter 
protein is expressed in said target cell at a level that is 
at least 110% of that expressed from the said reporter 
construct in said other cell. 

13. An improved method of constructing a synthetic 
polynucleotide from which a protein is expressible in a 
target cell of an organism at a higher level than from a 
parent polynucleotide expressing said protein, said method 
comprising : 
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- selecting a first codon of the parent polynucleotide 
for replacement with a synonymous codon which has a higher 
translational efficiency in said target cell than said 
first codon; 

5 _ replacing said first codon with said synonymous 

codon to form said synthetic polynucleotide, wherein said 
first codon and said synonymous codon are selected by: 

- comparing translational efficiencies of 
different individual codons in said target cell; and 

10 - selecting said first codon and said synonymous 

codon based on said comparison wherein said comparison 
comprises : 

- introducing into a target cell a synthetic 
construct comprising a reporter polynucleotide fused in 

15 frame with a tandem repeat of an individual codon, 

wherein said reporter polynucleotide encodes a reporter 
protein, and wherein said synthetic construct is 
operably linked to a regulatory polynucleotide; 

- introducing into a target cell a different 
20 synthetic construct comprising the reporter 

polynucleotide fused in frame with a tandem repeat of 
another individual codon; and 

- comparing expression of said reporter protein 
from each synthetic construct in said target cell; 

25 to thereby determine the translation efficiency of individual 
codons in said target cell. 

14. The method of claim 13, wherein said synonymous codon 
corresponds to a reporter construct from which the reporter 
protein is expressed in said target cell at a level that is 
30 at least 110% of that expressed from the different reporter 
construct corresponding to said first codon. 
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15. A synthetic polynucleotide constructed according to the 
method of claim 11 or claim 13. 

16. A vector comprising the synthetic polynucleotide of claim 
15. 

17. A cell comprising the synthetic polynucleotide of claim 
15. 

18. A cell comprising the vector of claim 16. 
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<110> The University of Queensland (all designated States except US) 
Frazer, Ian Hector and Zhou, Jian (US only) 

<120> METHOD AND POLYNUCLEOTIDES FOR DETERMINING TRANSLATIONAL 
EFFICIENCY OF A CODON 

<130> Codon optimization 

<14 0> Not yet assigned 
<141> Herewith 

<150> PP8078 
<151> 1999-01-08 

<160> 180 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description oJ 

<220> 
<221> CDS 
<222> (1) . . (732) 

<400> 1 

atg age age age age 
Met Ser Ser Ser Ser 
1 5 

gtc cca att etc gtg 
Val Pro lie Leu Val 
20 

tct gtc age gga gag 
Ser Val Ser Gly Glu 
35 

ctg aaa ttc ate tgc 
Leu Lys Phe lie Cys 
50 

ctg gtc act ace ttc 
Leu Val Thr Thr Phe 
65 

gac cat atg aag cag 
Asp His Met Lys Gin 
85 

tat gtg cag gag aga 
Tyr Val Gin Glu Arg 
100 

ace cgc get gaa gtc 



f Artificial Sequence: Ala(GCA)5GFP 



age age aag ggc gag gaa ctg ttc act ggc gtg 
Ser Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
10 15 



48 



gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 96 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
25 30 

ggt gaa ggt gat gee aca tac gga aag etc ace 144 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
40 4 5 

ace act gga aag etc cct gtg cca tgg cca aca 192 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
55 60 

tct tat ggc gtg cag tgc ttt tec aga tac cca 

Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 

70 75 80 

cat gac ttt ttc aag age gec atg ccc gag ggc 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
90 95 

ace ate ttt ttc aaa gat gac ggg aac tac aag 

Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
105 HO 

aag ttc gaa ggt gac ace ctg gtg aat aga ate 384 



240 



288 



336 
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432 



480 



Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 ~ 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 
Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gec gac 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 " 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 528 
Lvs Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 576 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec ace 
He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 ^ 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 
Leu Tyr Lys 



624 



672 



720 



732 



<210> 2 
<211> 243 
<212> PRT 
<213> Artificial 

<400> 2 

Met Ser Ser Ser 
1 

Val Pro He Leu 
20 

Ser Val Ser Gly 
35 

Leu Lys Phe lie 
50 

Leu Val Thr Thr 
65 

Asp His Met Lys 



Tyr Val Gin Glu 
100 

Thr Arg Ala Glu 
115 



Sequence 



Ser Ser Ser Lys 
5 

Val Glu Leu Asp 



Glu Gly Glu Gly 
40 



Cys Thr Thr Gly 
55 

Phe Ser Tyr Gly 
70 

Gin His Asp Phe 
85 

Arg Thr He Phe 



Val Lys Phe Glu 
120 



Gly Glu Glu Leu 
10 

Gly Asp Val Asn 
25 

Asp Ala Thr Tyr 



Lys Leu Pro Val 
60 

Val Gin Cys Phe 
75 

Phe Lys Ser Ala 
90 

Phe Lys Asp Asp 
105 

Gly Asp Thr Leu 



Phe Thr Gly Val 
15 

Gly His Lys Phe 
30 

Gly Lys Leu Thr 
45 

Pro Trp Pro Thr 



Ser Arg Tyr Pro 
80 



Met Pro Glu Gly 
95 



Gly Asn Tyr Lys 
110 

Val Asn Arg He 
125 
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Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 1*75 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 " 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 



48 



96 



<210> 3 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ala (GCC) 5GFP 

<220> 
<221> CDS 
<222> (1) . . (732) 

<400> 3 

atg gcc gcc gcc gcc gcc age aag ggc gag gaa ctg ttc act ggc gtg 

Met Ala Ala Ala Ala Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 

1 5 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gcc aca tac gga aag etc ace 144 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 " 40 45 

ctg aaa ttc ate tgc ace act gga aag etc cct gtg cca tgg cca aca 192 
Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act ace ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gcc atg ccc gag ggc 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga ace ate ttt ttc aaa gat gac ggg aac tac aag 
Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 



240 



288 



336 
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100 105 HO 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 384 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 

115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 432 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 

130 " 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gec gac 480 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 

145 * 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn He 

165 " 170 175 



gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 
He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 
Leu Tyr Lys 



528 



576 



624 



672 



720 



732 



<210> 4 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 4 

Met Ala Ala Ala Ala Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1.5 10 15 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 ~ 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 
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Thr Arq Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 " 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 1^0 

Lys Leu' Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 

Lvs Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 1*75 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 

<210> 5 

<211> 732 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ala(GCG)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 5 

atg gcg gcg gcg gcg gcg age aag ggc gag gaa ctg ttc act ggc gtg 

Met Ala Ala Ala Ala Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 

5 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc ace 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

ctg aaa ttc ate tgc ace act gga aag etc cct gtg cca tgg cca aca 
Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 



48 



96 



144 



192 



240 



288 
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tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 
Tvr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 



145 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 
He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 
Leu Tyr Lys 



336 



384 



4 32 



acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 
Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly Hxs 
130 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 480 
Lvs Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 

150 155 160 



528 



576 



62 4 



672 



720 



732 



<210> 6 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 6 

Met Ala Ala Ala Ala Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 
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Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 " - 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 

Lvs Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 ~ 150 155 160 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 



<210> 7 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ala(GCT)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 



<400> 7 

atg get get get get get age aag ggc gag gaa ctg ttc act ggc gtg 48 

Met Ala Ala Ala Ala Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 



gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 



ctg aaa ttc ate tgc ace act gga aag etc cct gtg cca tgg cca aca 
Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 



96 



tct gtc age gga gag ggt gaa ggt gat gee aca tac gga aag etc acc 14 4 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 



192 



ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 240 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 
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gac cat atg aag cag cat gac ttt ttc aag age gec atg ccc gag ggc 288 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 
Tvr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 



aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 
Lvs Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 
Lvs Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 
lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 
Leu Tyr Lys 

<210> 8 
<211> 243 
<212> PRT 

<213> Artificial Sequence 

<400> 8 ty . 

Met Ala Ala Ala Ala Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 



336 



384 



acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 

115 120 125 • 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 432 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 ' 135 140 



480 



528 



576 



624 



672 



720 



732 
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Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys lie Arg His Asn He 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 



<210> 9 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Arg (AGA) 5GFP 

<220> 
<221> CDS 
<222> (1) . . (732) 

<400> 9 

atg aga aga aga aga aga age 
Met Arg Arg Arg Arg Arg Ser 
1 5 

gtc cca att etc gtg gaa ctg 

Val Pro He Leu Val Glu Leu 
20 

tct gtc age gga gag ggt gaa 

Ser Val Ser Gly Glu Gly Glu 
35 

ctg aaa ttc ate tgc ace act 

Leu Lys Phe lie Cys Thr Thr 

50 55 



aag ggc gag gaa ctg ttc act ggc gtg 48 
Lys Gly Glu Glu Leu Phe Thr Gly Val 
10 15 

gat ggc gat gtg aat ggg cac aaa ttt 96 
Asp Gly Asp Val Asn Gly His Lys Phe 
25 30 

ggt gat gec aca tac gga aag etc ace 144 
Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
40 45 

gga aag etc cct gtg cca tgg cca aca 192 
Gly Lys Leu Pro Val Pro Trp Pro Thr 
60 



ctg gtc act ace ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 240 
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Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
— - ~~ 75 80 



65 70 



aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gec gac 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 



150 155 160 



ctg tac aag tga 
Leu Tyr Lys 

<210> 10 
<211> 243 
<212> PRT 

<213> Artificial Sequence 

<400> 10 t t n 

Met Arg Arg Arg Arg Arg Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 I 5 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 



288 



gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 
Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 432 
Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 



336 



384 



480 



528 



576 



145 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 

Lvs Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 

165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 

180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 624 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 

195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 

225 230 235 240 



672 



720 



732 
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Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 ~ 150 155 160 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys lie Arg His Asn He 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 



<210> 11 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Arg (AGG) 5GFP 

<220> 
<221> CDS 
<222> (1) . . (732) 

<400> 11 

atg agg agg agg agg agg age aag ggc 
Met Arg Arg Arg Arg Arg Ser Lys Gly 
1 5 

gtc cca att etc gtg gaa ctg gat ggc 
Val Pro He Leu Val Glu Leu Asp Gly 
20 25 

tct gtc age gga gag ggt gaa ggt gat 
Ser Val Ser Gly Glu Gly Glu Gly Asp 
35 40 

ctg aaa ttc ate tgc acc act gga aag 
Leu Lys Phe He Cys Thr Thr Gly Lys 



gag gaa ctg ttc 
Glu Glu Leu Phe 
10 

gat gtg aat ggg 
Asp Val Asn Gly 



gee aca tac gga 
Ala Thr Tyr Gly 
45 

etc cct gtg cca 
Leu Pro Val Pro 



act ggc gtg 48 
Thr Gly Val 
15 

cac aaa ttt 96 
His Lys Phe 
30 

aag etc acc 144 
Lys Leu Thr 



tgg cca aca 192 
Trp Pro Thr 
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50 55 60 



ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

qac cat atg aag cag cat gac ttt ttc aag age gec atg ccc gag ggc 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga acc ate ttt tec aaa gat gac ggg aac tac aag 
Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 " 105 HO 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 * 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc eac 
Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 
Lvs Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 
Lvs Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 1*75 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 
He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 " 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 
Leu Tyr Lys 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



732 



<210> 12 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 12 

Met Arg Arg Arg Arg Arg Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 " 40 45 



WO 00/42215 



- xiii - 



PCT/AUOO/00008 



Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 J 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 

<210> 13 

<211> 732 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Arg (CGA) 5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 13 

atg cga cga cga cga cga age aag ggc gag gaa ctg ttc act ggc gtg 

Met Arg Arg Arg Arg Arg Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 

1 " " 5 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc acc 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 4 0 4 5 



48 



96 



144 



WO 00/42215 



- xiv - 



PCT/AU00/00008 



ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 

50 55 60 

ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 

65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 

85 90 95 



ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 
Leu Tyr Lys 



192 



240 



288 



tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 336 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 

115 120 125 



384 



gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 432 
Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 " 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 
Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 
He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 672 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 



480 



528 



576 



624 



720 



732 



<210> 14 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 14 

Met Arg Arg Arg Arg Arg Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1 " 5 10 15 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 



WO 00/42215 



- XV - 



PCT/AUOO/00008 



Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 4 0 4 5 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 - 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 " 150 155 160 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys lie Arg His Asn He 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 ' 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 



<210> 15 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Arg(CGC)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 15 

atg cgc cgc cgc cgc cgc age aag ggc gag gaa ctg ttc act ggc gtg 

Met Arg Arg Arg Arg Arg Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 

15 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 



WO 00/42215 



- xvi - 



PCT/AU00/00008 



tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc acc 144 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 

• 35 40 45 

ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 

50 55 60 



ctg tac aag tga 
Leu Tyr Lys 



<210> 16 
<211> 243 
<212> PRT 

<213> Artificial Sequence 



192 



ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 240 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 
Tyr Val Gin Glu Arg Thr He Phe. Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 
Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 " 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 
Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 
He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 " 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 720 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 



288 



336 



384 



432 



480 



528 



576 



62 4 



672 



732 



<400> 16 

Met Arg Arg Arg Arg Arg Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1 - 5 10 15 



WO 00/42215 



- xvii - 



PCT/AU00/00008 



Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 " 40 45 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 ^ 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 



<210> 17 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Arg (CGG) 5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 17 

atg egg egg egg egg egg age aag ggc gag gaa ctg ttc act ggc gtg 4 8 

Met Arg Arg Arg Arg Arg Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1 5 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 96 



WO 00/42215 



- xviii - 



PCT/AU00/00008 



Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 

20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc ace 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 

35 40 45 

ctg aaa ttc ate tgc ace act gga aag etc cct gtg cca tgg cca aca 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 

65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 

85 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 

100 105 HO 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 

115 - 120 . 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 
Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 ' 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 
Lvs Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 

150 155 160 



145 



ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 
He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 J 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 
Leu Tyr Lys 

<210> 18 
<211> 243 
<212> PRT 

<213> Artificial Sequence 



144 



192 



240 



288 



336 



384 



432 



480 



528 



aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 
Lvs Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 57 6 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 



62 4 



672 



720 



732 



WO 00/42215 



- xix - 
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Met°Arq 8 Arg Arg Arg Arg Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1*5 10 15 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 " 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 - 90 95 

Tvr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 " 135 140 

Lvs Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 

Lvs Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 " 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 



<210> 19 
<211> 732 
<212> DNA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence: Arg (CGT) 5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 19 

atg cgt cgt cgt cgt cgt age aag ggc gag gaa ctg ttc act ggc gtg 
Met Arg Arg Arg Arg Arg Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 



WO 00/42215 



- xx - 



PCT/AU00/00008 



! 5 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc acc 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 
Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 
Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 
Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gec gac 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 " 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 
Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 57 6 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 



ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 
He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 
Leu Tyr Lys 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



624 



672 



720 



732 



<210> 20 
<211> 243 



WO 00/42215 



- xxi - 



PCT/AUOO/00008 



<212> PRT 

<213> Artificial Sequence 
<400> 20 

Met Arg Arg Arg Arg Arg Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1 " 5 10 15 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 " 150 155 160 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 ~ 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 

<210> 21 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Asn(AAC)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 



WO 00/42215 



- xxii - 
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48 



96 



144 



192 



240 



288 



384 



<400> 21 . 

atg aac aac aac aac aac age aag ggc gag gaa ctg ttc act ggc gtg 

Met Asn Asn Asn Asn Asn Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 

1 5 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gee aca tac gga aag etc ace 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

ctg aaa ttc ate tgc ace act gga aag etc cct gtg cca tgg cca aca 
Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctq gtc act ace ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga ace ate ttt ttc aaa gat gac ggg aac tac aag 336 
Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 "* 105 HO 

ace cgc get gaa gtc aag ttc gaa ggt gac ace ctg gtg aat aga ate 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 
Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 
Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 576 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec ace 
He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg ace get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 



432 



480 



528 



624 



672 



720 



ctg tac aag tga 
Leu Tyr Lys 



732 



WO 00/42215 



PCT/AU00/00008 



- xxin - 



<210> 22 
<211> 243 
<212> PRT 



<213> Artificial Sequence 



<400> 22 

Met Asn Asn Asn Asn 
1 5 



Asn 



Ser Lys Gly Glu Glu Leu 
10 



Phe Thr Gly Val 
15 



Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

Lvs Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 ^ 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 



<210> 23 
<211> 732 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Asn (AAT) 5GFP 



<220> 



WO 00/42215 



- xxiv - 



PCT/AU00/00008 



<221> CDS 
<222> (1) . . (732) . 

<400> 23 

atg aat aat aat aat aat age aag ggc gag gaa ctg ttc act ggc gtg 

Met Asn Asn Asn Asn Asn Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 

1 5 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gee aca tac gga aag etc acc 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly, Lys Leu Thr 
35 40 45 

ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 192 
Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 



ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 

65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 

85 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 

100 ~ 105 HO 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 



115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 " 135 140 



ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 

225 230 235 240 



48 



96 



144 



240 



288 



336 



384 



432 



480 



528 



aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 

145 ** 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

gag gat gga tec gtg cag ctg gec gac cat tat caa cag aac act cca 576 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 . 190 



624 



672 



720 



WO 00/42215 



- xxv - 



PCT/AU00/00008 



ctg tac aag tga 
Leu Tyr Lys 

<210> 24 
<211> 243 
<212> PRf 

<213> Artificial Sequence 

Met°Asn 4 Asn Asn Asn Asn Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 ' 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 ' 150 155 160 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 . 230 235 240 

Leu Tyr Lys 

<210> 25 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 



732 



WO 00/42215 



- xxvi ~ 



PCT/AU00/00008 



<223> Description of Artificial Sequence: Asp(GAC)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 25 

atg gac gac gac gac gac age aag ggc gag gaa ctg ttc act ggc gtg 

Met Asp Asp Asp Asp Asp Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 

1 5 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc acc 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 . 45 

ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 
Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 " 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 
Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 „ 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 
Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 



145 



aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 
Lvs Gin Lys Asn Giy He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 



gag 



gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 



48 



96 



144 



192 



240 



288 



336 



384 



4 32 



aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 480 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 

150 155 160 



528 



576 



62 4 



672 



720 



WO 00/42215 



- xxvii - 



PCT/AU00/00008 



Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 732 
Leu Tyr Lys 



<210> 26 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 26 

Met Asp Asp Asp Asp Asp Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1 5 10 15 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 



<210> 27 
<211> 732 
<212> DNA 



WO 00/42215 



- xxviii - 



PCT/AU00/00008 



<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Asp(GAT)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 27 

atg gat gat gat gat gat age aag ggc gag gaa ctg ttc act ggc gtg 
Met Asp Asp Asp Asp Asp Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 



acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gec gac 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 

145 " 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 

165 170 175 



ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 



48 



gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 96 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gee aca tac gga aag etc acc 144 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 

35 40 45 

ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 192 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 336 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 " 105 HO 



240 



288 



384 



4 32 



480 



528 



gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 576 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 



624 



672 



WO 00/42215 



- xxix - 



PCT/AUOO/00008 



210 



215. 



220 



ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 
Leu Tyr Lys 



720 



732 



<210> 28 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 28 

Met Asp Asp Asp Asp Asp Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 .10 15 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
8 5 90 95 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 ■ 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 



Leu Tyr Lys 



WO 00/42215 



- xxx - 



PCT/AUOO/00008 



<210> 29 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cys(TGC)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 29 

atg tgc tgc tgc tgc tgc age aag ggc gag gaa ctg ttc act ggc gtg 

Met Cys Cys Cys Cys Cys Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 

1 5 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 ' 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc ace 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

ctg aaa ttc ate tgc ace act gga aag etc cct gtg cca tgg cca aca 
Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act ace ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga ace ate ttt ttc aaa gat gac ggg aac tac aag 
Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

ace cgc get gaa gtc aag ttc gaa ggt gac ace ctg gtg aat aga ate 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 



aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 

145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 

165 170 175 



48 



96 



144 



192 



240 



288 



336 



384 



gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 4 32 
Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 



480 



528 



gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 576 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 



ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec ace 
lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 ^ 200 205 



624 



WO 00/42215 



- xxxi - 



PCT/AU00/00008 



cag tct gcc ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 



ctg tac aag tga 
Leu Tyr Lys 



672 



ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 720 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 



732 



<210> 30 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 30 

Met Cys Cys Cys Cys Cys Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 " 150 155 160 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 ~ 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 



WO 00/42215 



- xxxii - 



PCT/AUOO/00008 



Leu Tyr Lys 

<210> 31 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cys{TGT)5GFP 

<220> 

<221> CDS 

<222> (1) - . (732) 

<400> 31 

atg tgt tgt tgt tgt tgt age aag ggc gag gaa ctg ttc act ggc gtg 

Met Cys Cys Cys Cys Cys Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
I 5 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc ace 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

ctg aaa ttc ate tgc ace act gga aag etc cct gtg cca tgg cca aca 
Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act ace ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gec atg ccc gag ggc 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga ace ate ttt ttc aaa gat gac ggg aac tac aag 
Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

acc cgc get gaa gtc aag ttc gaa ggt gac ace ctg gtg aat aga ate 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 
Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gec gac 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 
Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 175 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



gag 



^„ gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 576 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 



WO 00/42215 



- xxxiii - 



PCT/AU00/00008 



ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 
lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 



624 



672 



720 



ctg tac aag tga 732 
Leu Tyr Lys 

<210> 32 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 32 

Met Cys Cys Cys Cys Cys Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1 5 10 15 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 4 0 4 5 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 

85 90 95 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 " 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 J 150 155 160 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 " 215 220 



WO 00/42215 



- xxxiv - 



PCT/AU00/00008 



Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 



<210> 33 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Gln(CAA)5GFP 



<220> 
<221> CDS 
<222> (1) . 



(732) 



<400> 33 

atg caa caa caa caa caa age aag ggc gag gaa ctg ttc act ggc gtg 

Met Gin Gin Gin Gin Gin Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1 - 5 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc acc 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 

65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 " 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 

145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 576 
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Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 

180. 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec ace 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 

195 200 205 



ctg ctg gag ttt gtg ace get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 



62 4 



cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 672 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 



720 



ctg tac aag tga 732 
Leu Tyr Lys 

<210> 34 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 34 

Met Gin Gin Gin Gin Gin Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 4 0 4 5 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 
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Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 



<210> 35 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Gln(CAG)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 35 

atg cag cag cag cag cag age aag ggc gag gaa ctg ttc act ggc gtg 
Met Gin Gin Gin Gin Gin Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 



ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 
Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 
Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 ~ 105 HO 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 
Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 ' 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 
Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 



48 



96 



tct gtc age gga gag ggt gaa ggt gat gee aca tac gga aag etc acc 144 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 



192 



240 



288 



336 



384 



432 



480 



528 
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165 170 175 

gag gat gga tec gtg cag ctg gec gac cat tat caa cag aac act cca 576 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 

180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec ace 624 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 

195 200 205 



cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg ace get get ggg ate aca cat ggc atg gac gag 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 

225 230 235 240 



672 



720 



ctg tac aag tga 732 
Leu Tyr Lys 



<210> 36 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 36 

Met Gin Gin Gin Gin Gin Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 ^ 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 " 135 ^ 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 ~ 160 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 
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lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 " 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 

<210> 37 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Glu (GAA) 5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 37 

atg gaa gaa gaa gaa gaa age aag ggc gag gaa ctg ttc act ggc gtg 

Met Glu Glu Glu Glu Glu Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 

15 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gee aca tac gga aag etc ace 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 4 0 4 5 

ctg aaa ttc ate tgc ace act gga aag etc cct gtg cca tgg cca aca 
Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act ace ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga ace ate ttt ttc aaa gat gac ggg aac tac aag 
Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 " 105 110 



48 



96 



144 



192 



240 



288 



336 



ace cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 384 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 4 32 
Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 " 135 140 



aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 



480 



WO 00/42215 



- xxxix - 



PCT/AU00/00008 



aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 
Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
Y 165 170 175 

gag gat gga tec gtg cag ctg gee gac. cat tat caa cag aac act cca 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

< 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec ace 
He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg "ace get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 
Leu Tyr Lys 



528 



576 



62 4 



67 2 



720 



732 



<210> 38 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 38 

Met Glu Glu Glu Glu Glu Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 4 0 4 5 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 ' 150 155 160 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 
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5 

Glu Asp Gly Ser Val 
180 . 

lie Gly Asp Gly Pro 
195 

Gin Ser Ala Leu Ser 
210 

Leu Leu Glu Phe Val 
225 

Leu Tyr Lys 



Gin Leu Ala Asp His Tyr 
185 



Val Leu Leu Pro Asp Asn 
200 



Lys Asp Pro Asn Glu Lys 
215 



Thr Ala Ala Gly He Thr 
230 235 



Gin Gin Asn Thr Pro 
190 

His Tyr Leu Ser Thr 
205 

Arg Asp His Met Val 
220 

His Gly Met Asp Glu 
240 



<210> 39 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Glu(GAG)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 39 

atg gag gag gag gag gag age aag ggc gag gaa ctg ttc act ggc gtg 
Met Glu Glu Glu Glu Glu Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 



tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc acc 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 

65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gec atg ccc gag ggc 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 



48 



gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 96 
Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 



144 



192 



240 



288 



336 



384 



gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 432 
Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 
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aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 480 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 528 
Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 576 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec ace 624 
lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 672 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg ace get get ggg ate aca cat ggc atg gac gag 720 
Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 732 
Leu Tyr Lys 



<210> 40 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 40 

Met Glu Glu Glu Glu Glu Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 
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Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 

/' 

<210> 41 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Gly (GGA) 5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 41 

atg gga gga gga gga gga age aag ggc gag gaa ctg ttc act ggc gtg 

Met Gly Gly Gly Gly Gly Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 

1 5 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc ace 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 



ctg gtc act ace ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 



tat gtg cag gag aga ace ate ttt ttc aaa gat gac ggg aac tac aag 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

ace cgc get gaa gtc aag ttc gaa ggt gac ace ctg gtg aat aga ate 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 



48 



96 



144 



ctg aaa ttc ate tgc ace act gga aag etc cct gtg cca tgg cca aca 192 
Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 " 55 60 



240 



gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 288 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 



336 



384 



432 
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Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 

130 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 

145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 

165 170 175 



480 



528 



576 



gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec ace 
lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg ace get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 732 
Leu Tyr Lys 



624 



672 



720 



<210> 42 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 42 

Met Gly Gly Gly Gly Gly Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1.5 10 15 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 
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Lvs Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 

Lvs Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 



<210> 43 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Gly(GGC)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 43 

atg ggc ggc ggc ggc ggc age aag ggc gag gaa ctg ttc act ggc gtg 4b 
Met Gly Gly Gly Gly Gly Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 



1 



5 J 10 15 



gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 96 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc ace 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 



144 



ctg aaa ttc ate tgc ace act gga aag etc cct gtg cca tgg cca aca 
Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act ace ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga ace ate ttt ttc aaa gat gac ggg aac tac aag 
Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 " 105 HO 

ace cgc get gaa gtc aag ttc gaa ggt gac ace ctg gtg aat aga ate 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 



192 



240 



288 



336 



384 
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115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 
Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 ' 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 
Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 17 5 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec ace 
lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg ace get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 



432 



480 



528 



576 



624 



672 



720 



ctg tac aag tga 732 
Leu Tyr Lys 

<210> 44 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 44 

Met Gly Gly Gly Gly Gly Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 
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Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn lie 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 



Leu Tyr Lys 



<210> 45 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Gly (GGG ) 5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 45 

atg ggg ggg ggg ggg ggg age aag ggc gag gaa ctg ttc act ggc gtg 
Met Gly Gly Gly Gly Gly Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc acc 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 * 40 45 

ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 
Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 



48 



96 



144 



192 



240 



288 



tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 336 
Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 
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acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 
Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 " 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 
Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

gag gat gga tec gtg cag ctg gec -gac cat tat caa cag aac act cca 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 
He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 ' 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 
Leu Tyr Lys 



384 



432 



480 



528 



576 



624 



'672 



720 



732 



<210> 46 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 46 

Met Gly Gly Gly Gly Gly Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
8 5 90 95 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 
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Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 ~ 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn lie 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 "* 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 

<210> 47 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Gly (GGT) 5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 47 

atg ggt ggt ggt ggt ggt age aag ggc gag gaa ctg ttc act ggc gtg 

Met Gly Gly Gly Gly Gly Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1 " 5 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 



ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 

Leu Lys Phe v lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 

50 " 55 60 

ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 

65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gec atg ccc gag ggc 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 

85 90 95 



48 



96 



tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc acc 144 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 ~ 40 45 



192 



240 



288 
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tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 336 
Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 



acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 



aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 ' 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 
Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 
He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 " 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 
■Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 
Leu Tyr Lys 

<210> 48 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 48 

Met Gly Gly Gly Gly Gly Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1 ' 5 10 15 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 



384 



gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 4 32 
Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 



480 



528 



576 



624 



672 



720 



732 
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Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys lie Arg His Asn He 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala' Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 

<210> 49 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: His(CAC)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 49 

atg cac cac cac cac cac age aag ggc gag gaa ctg ttc act ggc gtg 

Met His His His His His Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 



ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 
Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 



48 



gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 96 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gee aca tac gga aag etc acc 144 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 

35 40 45 



192 



ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 240 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 



gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 288 
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Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
8 5 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 
Tvr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 
Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 ( 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gec gac 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 
Lvs Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 ' 1*70 175 

qag gat gga tec gtg cag ctg gec gac cat tat caa cag aac act cca 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 
He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 " 200 205 

cag tct gec ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 
Leu Tyr Lys 



336 



384 



432 



480 



528 



576 



624 



672 



720 



732 



<210> 50 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 50 

Met His His His His His Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1 5 10 15 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 
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Asp His Met Lys 



Tyr Val Gin Glu 
100 

Thr Arg Ala Glu 
115 

Glu Leu Lys Gly 
130 

Lys Leu Glu Tyr 
145 

Lys Gin Lys Asn 



Glu Asp Gly Ser 
180 

lie Gly Asp Gly 
195 

Gin Ser Ala Leu 
210 

Leu Leu Glu Phe 
225 

Leu Tyr Lys 



Gin His Asp Phe 

. 85 

Arg Thr lie Phe 



Val Lys Phe Glu 
120 



lie Asp Phe Lys 
135 

Asn Tyr Asn Ser 
150 

Gly lie Lys Val 
165 

Val Gin Leu Ala 



Pro Val Leu Leu 
200 



Ser Lys Asp Pro 
215 

Val Thr Ala Ala 
230 



Phe Lys Ser Ala 
90 

Phe Lys Asp Asp 
105 

Gly Asp Thr Leu 



Glu Asp Gly Asn 
140 



His Asn Val Tyr 
155 

Asn Phe Lys lie 
170 

Asp His Tyr Gin 
185 

Pro Asp Asn His 



Asn Glu Lys Arg 
220 

Gly He Thr His 
235 



Met Pro Glu Gly 
95 



Gly Asn Tyr Lys 
110 

Val Asn Arg He 
125 

He Leu Gly His 



He Met Ala Asp 
160 



Arg His Asn He 
175 

Gin Asn Thr Pro 
190 

Tyr Leu Ser Thr 
205 

Asp His Met Val 



Gly Met Asp Glu 
240 



<210> 51 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: His(CAT)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 51 

atg cat cat cat cat cat age aag ggc gag gaa ctg ttc act ggc gtg 
Met His His His His His Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 



ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 



48 



96 



tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc acc 14 4 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 

35 40 45 

ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 



192 



240 
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65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gec atg ccc gag ggc 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 

85 90 95 



acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 



aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 



aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 
Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn lie 
165 170 175 



gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 
He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 
Leu Tyr Lys 



288 



tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 336 
Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 



384 



gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 432 
Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 



480 



528 



576 



624 



672 



720 



732 



<210> 52 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 52 

Met His His His His His Ser Lys 
1 5 

Val Pro He Leu Val Glu Leu Asp 
20 

Ser Val Ser Gly Glu Gly Glu Gly 

35 40 



Gly Glu Glu Leu Phe Thr Gly Val 
10 15 

Gly Asp Val Asn Gly His Lys Phe 
25 30 

Asp Ala Thr Tyr Gly Lys Leu Thr 
45 



Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 
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Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
8 5 90 95 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys lie Arg His Asn He 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 



<210> 53 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ile(ATA)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 53 

atg ata ata ata ata ata age aag ggc gag gaa ctg ttc act ggc gtg 48 

Met He He He He He Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 

15 10 15 



gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc ace 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

ctg aaa ttc ate tgc ace act gga aag etc cct gtg cca tgg cca aca 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 



96 



144 



192 
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ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 
lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 
Leu Tyr Lys 



288 



336 



384 



ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 240 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gec atg ccc gag ggc 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 
Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 432 
Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 "* 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gec gac 480 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 528 
Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 576 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 



624 



672 



720 



732 



<210> 54 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 54 

Met He He He He He Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 



Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 



WO 00/42215 



- Ivi - 
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Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 " 105 HO 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 ~ 150 155 160 

Lys Gin Lys' Asn Gly He Lys Val Asn Phe Lys He Arg His Asn lie 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 



<210> 55 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ile(ATC)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 55 

atg ate ate ate ate ate age aag ggc gag gaa ctg ttc act ggc gtg 

Met He He He He He Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gee aca tac gga aag etc acc 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 



WO 00/42215 
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ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 192 
Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 
Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 432 
Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly Hxs 
130 135 140 



aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 

145 ' 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 

Lvs Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 

165 170 175 

gag gat gga tec gtg cag ctg gec gac cat tat caa cag aac act cca 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 

180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 

195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 

210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 

225 230 235 240 

ctg tac aag tga 
Leu Tyr Lys 



240 



288 



336 



384 



480 



528 



576 



624 



672 



720 



732 



<210> 56 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 56 

Met He He He He He Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1 5 10 15 



Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 



WO 00/42215 
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Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 "70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
8 5 90 95 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 

<210> 57 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: He (ATT) 5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 57 

atg att att att att att age aag ggc gag gaa ctg ttc act ggc gtg 48 

Met He He He He He Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 

15 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 96 
Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 



tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc acc 14 4 



WO 00/42215 
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Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

ctq aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 192 
Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctq gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

qac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 
Tvr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 432 
Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 " 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 

150 155 160 



145 

aag caa aag 



ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 
He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 ' 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 
Leu Tyr Lys 

\ 

<210> 58 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 58 

Met He He lie lie He Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 



240 



288 



336 



384 



480 



528 



aat ggc ate aag gtc aac ttc aag ate aga cac aac att 
Lvs Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 57 6 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 



624 



672 



720 



732 



WO 00/42215 
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Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 . 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 ~ 105 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 ~ 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 

<210> 59 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Leu(CTA)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 59 

atg eta eta eta eta eta age aag ggc gag gaa ctg ttc act ggc gtg 48 
Met Leu Leu Leu Leu Leu Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 



gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 96 
Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
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20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc ace 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

ctg aaa ttc ate tgc ace act gga aag etc cct gtg cca tgg cca aca 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act ace ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 

65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 

85 90 95 

tat gtg cag gag aga ace ate ttt ttc aaa gat gac ggg aac tac aag 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

ace cgc get gaa gtc aag ttc gaa ggt gac ace ctg gtg aat aga ate 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 

145 "* 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys He Arg His Asn He 

165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 



cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 ~ 215 220 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec ace 624 
He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 



672 



ctg ctg gag ttt gtg ace get get ggg ate aca cat ggc atg gac gag 720 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 



ctg tac aag tga 
Leu Tyr Lys 



732 



<210> 60 
<211> 243 
<212> PRT 

<213> Artificial Sequence 



<400> 60 



WO 00/42215 
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Met Leu Leu Leu Leu Leu Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1.5 10 15 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 ' 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 



<210> 61 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Leu(CTC)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 61 

atg etc etc etc etc etc age aag ggc gag gaa ctg ttc act ggc gtg 48 

Met Leu Leu Leu Leu Leu Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 



WO 00/42215 



lxiii 
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gtc cca att etc. gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 



96 



tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc acc 14 4 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 

35 40 45 

ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 192 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 

50 55 60 

ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 240 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 

65 70 75 80 

gac cat atg aag cag cat gac ttt • ttc aag age gee atg ccc gag ggc 288 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 

85 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 336 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 

100 105 HO 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 384 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 

115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 4 32 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 

130 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 480 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 

145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 

165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 

180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 624 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 

195 " 200 205 



cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 



ctg tac aag tga 
Leu Tyr Lys 



528 



576 



672 



ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 720 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 



732 



<210> 62 
<211> 243 
<212> PRT 



WO 00/42215 
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<213> Artificial Sequence 
<400> 62 

Met Leu Leu Leu Leu Leu Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 4 0 4 5 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 " 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 

<210> 63 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Leu(CTG)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 



<400> 63 



WO 00/42215 
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atg ctg ctg ctg ctg ctg age aag ggc gag gaa ctg ttc act ggc gtg 
Met Leu Leu Leu Leu Leu Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1 5 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 



ace cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 

115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 

145 ' 150 155 160 



ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 
lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 1 200 205 



ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 
Leu Tyr Lys 



48 



96 



tct gtc age gga gag ggt gaa ggt gat gee aca tac gga aag etc acc 144 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 192 
Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 336 
Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 



240 



288 



384 



432 



480 



aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 528 
Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn lie 
165 " 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 57 6 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 



624 



cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 672 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 



720 



732 



WO 00/42215 



- lxvi - 



PCT/AU00/00008 



<210> 64 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 64 

Met Leu Leu Leu Leu Leu Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 J 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 



<210> 65 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Leu(CTT)5GFP 

<220> 
<221> CDS 



WO 00/42215 



- lxvii - 



PCT/AU00/00008 



<222> (1) . . (732) 
<400> 65 

atg ctt ctt ctt ctt ctt age aag ggc gag gaa ctg ttc act ggc gtg 
Met Leu Leu Leu Leu Leu Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 



gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 
Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 " 135 140 



48 



gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 96 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 

20 25 30 

tct gtc age gga gag ggt gaa ggt gat gee aca tac gga aag etc acc 14 4 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 192 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 

65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 

85 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 336 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 

100 105 110 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 384 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 



240 



288 



432 



aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 480 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 528 
Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 57 6 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 624 
He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 ^ 200 ~ 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 672 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 720 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 



ctg tac aag tga 



732 



WO 00/42215 



- lxviii - 



PCT/AUOO/00008 



Leu Tyr Lys 

<210> 66 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 66 

Met Leu Leu Leu Leu Leu Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
8 5 90 95 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 " 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 



<210> 67 
<211> 732 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Leu (TTA) 5GFP 



WO 00/42215 



- Ixix - 



PCT/AUOO/00008 



<220> 

<221> CDS 

<222> (1) . . (732) 



<400> 67 

atg tta tta tta tta tta age aag ggc gag gaa ctg ttc act ggc gtg 

Met Leu Leu Leu Leu Leu Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 

15 10 15 



48 



gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 96 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 

20 25 30 

tct gtc age gga gag ggt gaa ggt gat gee aca tac gga aag etc acc 144 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 ~ 40 45 

ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 192 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 

65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 

85 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 

100 ~ 105 110 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 14 0 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 

145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 

165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 

180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 672 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 



240 



288 



336 



384 



4 32 



480 



528 



576 



624 



ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 



720 



WO 00/42215 



- lxx - 



PCT/AUOO/00008 



225 230 235 240 

ctg tac aag tga 
Leu Tyr Lys 

<210> 68 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 68 

Met Leu Leu Leu Leu Leu Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 4 0 4 5 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 

85 90 95 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 

<210> 69 
<211> 732 
<212> DNA 

<213> Artificial Sequence 



732 



WO 00/42215 PCT/AUOO/00008 

- lxxi - 



<220> 

<223> Description of Artificial Sequence: Leu (TTG) 5GFP 

<220> 
<221> CDS 
<222> (1) . . (732) 

<400> 69 

atg ttg ttg ttg ttg ttg age aag ggc gag gaa ctg ttc act ggc gtg 

Met Leu Leu Leu Leu Leu Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc ace 144 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 



35 4 0 4 5 

ctg aaa ttc ate tgc ace act gga aag etc cct gtg cca tgg cca aca 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act ace ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
8 5 .90 95 

tat gtg cag gag aga ace ate ttt ttc aaa gat gac ggg aac tac aag 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 

100 105 110 



48 



96 



192 



240 



288 



336 



480 



528 



ace cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 384 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 432 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 ^ 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 

145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn lie 
165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 576 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 624 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 * 200 205 

cag tct gec ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 672 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 



WO 00/42215 



- lxxii - 



PCT/AU00/00008 



ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 



720 



ctg tac aag tga 
Leu Tyr Lys 



732 



<210> 70 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 70 

Met Leu Leu Leu Leu Leu Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 " 105 HO 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 17 5 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 



<210> 71 



WO 00/42215 



- lxxiii - 



PCT/AUOO/00008 



<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Lys (AAA) 5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 71 

atg aaa aaa aaa aaa aaa age aag ggc gag gaa ctg ttc act ggc gtg 

Met Lys Lys Lys Lys Lys Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 



48 



gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 96 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 

20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc ace 14 4 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 

35 40 45 

ctg aaa ttc ate tgc ace act gga aag etc cct gtg cca tgg cca aca 192 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 

50 55 60 

ctg gtc act ace ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 240 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 288 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 

85 90 95 

tat gtg cag gag aga ace ate ttt ttc aaa gat gac ggg aac tac aag 336 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 

100 105 110 

ace cgc get gaa gtc aag ttc gaa ggt gac ace ctg gtg aat aga ate 384 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 

115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 

130 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 480 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 

145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 528 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 

165 170 " 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 576 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 

180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec ace 624 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 

195 200 205 



432 



WO 00/42215 



- lxxiv - 



PCT/AU00/00008 



cag tct gcc ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 



672 



ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 720 
Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 732 
Leu Tyr Lys 



<210> 72 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 72 

Met Lys Lys Lys Lys Lys Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1 5 10 15 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 4 14 0 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 . 150 155 160 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 



Leu Tyr Lys 



WO 00/42215 



- Ixxv - 



PCT/AU00/00008 



48 



96 



<210> 73 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Lys (AAG) 5GFP 

<220> 
<221> CDS 
<222> (1) . . (732) 

<400> 73 

atg aag aag aag aag aag age aag ggc gag gaa ctg ttc act ggc gtg 

Met Lys Lys Lys Lys Lys Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1 ** 5 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 i 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc ace 144 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

ctg aaa ttc ate tgc ace act gga aag etc cct gtg cca tgg cca aca 
Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act ace ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga ace ate ttt ttc aaa gat gac ggg aac tac aag 
Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 



192 



240 



288 



336 



384 



gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 432 
Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 * 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 480 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 528 
Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 576 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 



ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 



624 



WO 00/42215 



- lxxvi - 



PCT/AU00/00008 



lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 

195 200 205 

cag tct gcc ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 672 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 

210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 720 

Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 

225 230 235 240 



ctg tac aag tga 
Leu Tyr Lys 

<210> 74 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 74 

Met Lys Lys Lys Lys Lys Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1 ~ 1 5 10 15 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
8 5 90 95 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 



732 
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225 230 235 240 

Leu Tyr Lys 

<210> 75. 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Phe (TTT) 5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 75 

atg ttt ttt ttt ttt ttt age aag ggc gag gaa ctg ttc act ggc gtg 

Met Phe Phe Phe Phe Phe Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 

15 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc ace 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 



tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 
Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 



48 



96 



144 



ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 192 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 

50 55 60 

ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 240 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 

65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 288 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 

85 90 95 



336 



acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 384 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 

115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 432 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 

130 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 480 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 

145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 528 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 175 



gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 576 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
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180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec ace 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 

195 200 205 



624 



cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 672 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 



ctg ctg gag ttt gtg ace get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 



720 



ctg tac aag tga 732 
Leu Tyr Lys 

<210> 76 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 76 

Met Phe Phe Phe Phe Phe Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
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210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 

<210> 77 

<211> 732 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Phe (TTC) 5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 77 

atg ttc ttc ttc ttc ttc age aag ggc gag gaa ctg ttc act ggc gtg 48 

Met Phe Phe Phe Phe Phe Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 

1 5 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 96 
Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc ace 144 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

ctg aaa ttc ate tgc ace act gga aag etc cct gtg cca tgg cca aca 192 
Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act ace ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 240 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 288 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga ace ate ttt ttc aaa gat gac ggg aac tac aag 336 
Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

ace cgc get gaa gtc aag ttc gaa ggt gac ace ctg gtg aat aga ate 384 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 4 32 
Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 ~ 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 480 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 528 
Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 ~ 17 5 
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gag gat gga tec gtg cag ctg gec gac cat tat caa cag aac act cca 576 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 624 
lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 ~ 200 205 

cag tct gec ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 672 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 720 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 732 
Leu Tyr Lys 



<210> 78 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 78 

Met Phe Phe Phe Phe Phe Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
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195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 ' 240 

Leu Tyr Lys 

<210> 79 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Pro(CCC)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 79 

atg ccc ccc ccc ccc ccc age aag ggc gag gaa ctg ttc act ggc gtg 48 

Met Pro Pro Pro Pro Pro Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 96 
Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc acc 144 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 " 40 45 

ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 192 
Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 240 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro, 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gec atg ccc gag ggc 288 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 336 
Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 384 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 432 
Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 " 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gec gac 480 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 
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aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 528 
Lys Gin Lys Asn. Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 
lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr*His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 12>2 
Leu Tyr Lys 



576 



624 



672 



720 



<210> 80 
<211>' 243 
<212> PRT 

<213> Artificial Sequence 
<400> 80 

Met Pro Pro Pro Pro Pro Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1 5 10 15 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
t 20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 " 40 45 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys lie Arg His Asn He 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
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180 185 190 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 



<210> 81 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Pro(CCG)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 81 

atg ccg ccg ccg ccg ccg age aag ggc gag gaa ctg ttc act ggc gtg 4 8 

Met Pro Pro Pro Pro Pro Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 96 
Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc acc 144 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 192 
Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 240 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gec atg ccc gag ggc 288 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 336 
Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 * 110 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 384 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 4 32 
Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gec gac 480 
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Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 

145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 528 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 

165 170 175 



gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 



576 



ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec ace 624 
lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 672 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 " 215 220 

ctg ctg gag ttt gtg ace get get ggg ate aca cat ggc atg gac gag 720 
Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 732 
Leu Tyr Lys 

<210> 82 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 82 

Met Pro Pro Pro Pro Pro Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr. Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
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165 

Glu Asp Gly Ser Val Gin Leu Ala 
180 

lie Gly Asp Gly Pro Val Leu Leu 
195 200 

Gin Ser Ala Leu Ser Lys Asp Pro 
210 215 

Leu Leu Glu Phe Val Thr Ala Ala 
225 230 

Leu Tyr Lys 



170 175 

Asp His Tyr Gin Gin Asn Thr Pro 
185 " 190 

Pro Asp Asn His Tyr Leu Ser Thr 
205 

Asn Glu Lys Arg Asp His Met Val 
220 

Gly lie Thr His Gly Met Asp Glu 
235 240 



<210> 83 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Pro(CCT)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 83 

atg cct cct cct cct cct age aag ggc gag gaa ctg ttc act ggc gtg 4 8 

Met Pro Pro Pro Pro Pro Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 96 
Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc ace 14 4 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

ctg aaa ttc ate tgc ace act gga aag etc cct gtg cca tgg cca aca 192 
Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act ace ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 240 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 288 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
8 5 90 95 

tat gtg cag gag aga ace ate ttt ttc aaa gat gac ggg aac tac aag 336 
Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

ace cgc get gaa gtc aag ttc gaa ggt gac ace ctg gtg aat aga ate 384 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 



gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 432 
Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
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130 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 480 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 528 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 

165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 57 6 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec ace 624 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 672 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg ace get get ggg ate aca cat ggc atg gac gag 720 

Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 732 
Leu Tyr Lys 



<210> 84 

<211> 243 

<212> PRT 

<213> Artificia 

<400> 84 
Met Pro Pro Pro 
1 

Val Pro lie Leu 
20 

Ser Val Ser Gly 
35 

Leu Lys Phe lie 
50 

Leu Val Thr Thr 
65 

Asp His Met Lys 



Tyr Val Gin Glu 
100 

Thr Arg Ala Glu 
115 

Glu Leu Lys Gly 
130 

Lys Leu Glu Tyr 



Sequence 



Pro Pro Ser Lys 
5 

Val Glu Leu Asp 



Glu Gly Glu Gly 
40 

Cys Thr Thr Gly 
5 5 

Phe Ser Tyr Gly 
70 

Gin His Asp Phe 
85 

Arg Thr lie Phe 



Val Lys Phe Glu 
120 

lie Asp Phe Lys 
135 



Asn Tyr Asn Ser 



Gly Glu Glu Leu 
10 

Gly Asp Val Asn 
25 



Asp Ala Thr Tyr 



Lys Leu Pro Val 
60 



Val Gin Cys Phe 
75 

Phe Lys Ser Ala 
90 



Phe Lys Asp Asp 
105 



Gly Asp Thr Leu 



Glu Asp Gly Asn 
140 



His Asn Val Tyr 



Phe Thr Gly Val 
15 

Gly His Lys Phe 
30 

Gly Lys Leu Thr 
4 5 

Pro Trp Pro Thr 



Ser Arg Tyr Pro 
80 

Met Pro Glu Gly 
95 



Gly Asn Tyr Lys 
110 

Val Asn Arg lie 
125 

lie Leu Gly His 



lie Met Ala Asp 
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145 

Lys Gin Lys Asn 



Glu Asp Gly Ser 
180 



lie Gly Asp Gly 
195 

Gin Ser Ala Leu 
210 

Leu Leu Glu Phe 
225 

Leu Tyr Lys 



150 



Gly lie Lys Val 
165 

Val Gin Leu Ala 



Pro Val Leu Leu 
200 

Ser Lys Asp Pro 
215 

Val Thr Ala Ala 
230 



155 

Asn Phe Lys lie 
170 

Asp His Tyr Gin 
185 

Pro Asp Asn His 



Asn Glu Lys Arg 
220 

Gly He Thr His 
235 



160 



Arg His Asn He 
175 

Gin Asn Thr Pro 
190 

Tyr Leu Ser Thr 
205 

Asp His Met Val 



Gly Met Asp Glu 
240 



<210> 85 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Pro(CGA)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 85 

atg cga cga cga cga cga age aag ggc gag gaa ctg ttc act ggc gtg 48 

Met Arg Arg Arg Arg Arg Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 96 
Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc ace 14 4 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

ctg aaa ttc ate tgc ace act gga aag etc cct gtg cca tgg cca aca 192 
Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 240 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 288 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 336 
Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 384 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 
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gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 432 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 480 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 

145 " 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 528 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 57 6 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec ace 624 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 672 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg ace get get ggg ate aca cat ggc atg gac gag 720 

Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 

225 230 235 240 

ctg tac aag tga 732 
Leu Tyr Lys 



<210> 86 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 86 

Met Arg Arg Arg Arg Arg Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1 5 10 15 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 ~ 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 ' 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 ' 120 125 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
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130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 

<210> 87 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ser(AGC)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 87 

atg age age age age age age aag ggc gag gaa ctg ttc act ggc gtg 

Met Ser Ser Ser Ser Ser Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 

15 10 15 



gac cat atg aag cag cat gac ttt ttc aag age gec atg ccc gag ggc 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 



48 



gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 96 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 

20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc acc 14 4 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 192 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 240 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 

65 70 75 80 



288 



tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 336 
Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 
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acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 



aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 
Lys. Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 



384 



gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 432 
Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 



480 



aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 528 
Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 17 0 17 5 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag^ aac act cca 576 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 624 
lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 672 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 720 
Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 7 32 

Leu Tyr Lys 



<210> 88 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 88 

Met Ser Ser Ser Ser Ser Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 ~ 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
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115 120 125 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 ~ 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 



<210> 89 

<211> 732 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ser (AGT) 5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 89 

atg agt agt agt agt agt age aag ggc gag gaa ctg ttc act ggc gtg 48 

Met Ser Ser Ser Ser Ser Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 

1 5 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 96 
Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc ace 144 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 " 40 45 

ctg aaa ttc ate tgc ace act gga aag etc cct gtg cca tgg cca aca 192 
Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act ace ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 240 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 288 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 



tat gtg cag gag aga ace ate ttt ttc aaa gat gac ggg aac tac aag 



336 
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Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 . 105 110 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 384 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 432 
Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 * 135 "* 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 480 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 528 
Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 576 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 624 
lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 672 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 



ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 



720 



ctg tac aag tga 732 
Leu Tyr Lys 

<210> 90 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 90 

Met Ser Ser Ser Ser Ser Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 1 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
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100 105 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 

<210> 91 
<211> 732 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Ser(TCA)5GFP 



<220> 

<221> CDS 

<222> (1) . . (732) 



<400> 91 

atg tea tea tea tea tea age aag ggc gag gaa ctg ttc act ggc gtg 48 

Met Ser Ser Ser Ser Ser Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 96 
Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 



tct gtc age gga gag ggt gaa ggt gat gee aca tac gga aag etc ace 144 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 4 0 4 5 

ctg aaa ttc ate tgc ace act gga aag etc cct gtg cca tgg cca aca 192 
Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act ace ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 240 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 



gac cat atg aag cag cat gac ttt ttc aag age gec atg ccc gag ggc 288 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
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85 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 

100 105 110 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 

115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 

130 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 ** 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 

180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 

195 " 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 

210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 

Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 



336 



384 



432 



480 



528 



576 



624 



672 



720 



ctg tac aag tga 7 32 
Leu Tyr Lys 



<210> 92 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 92 

Met Ser Ser Ser Ser Ser Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 4 0 4 5 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
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Tyr Val Gin Glu 
100 

Thr Arg Ala Glu 
115 

Glu Leu Lys Gly 
130 

Lys Leu Glu Tyr 
145 

Lys Gin Lys Asn 



Glu Asp Gly Ser 
180 



He Gly Asp Gly 
195 

Gin Ser Ala Leu 
210 

Leu Leu Glu Phe 

225 

Leu Tyr Lys 



85 

Arg Thr He Phe 



Val Lys Phe Glu 
120 

He Asp Phe Lys 
135 

Asn Tyr Asn Ser 
150 

Gly lie Lys Val 
165 

Val Gin Leu Ala 



Pro Val Leu Leu 
200 

Ser Lys Asp Pro 
215 

Val Thr Ala Ala 
230 



90 



Phe Lys Asp Asp 
105 



Gly Asp Thr Leu 



Glu Asp Gly Asn 
140 

His Asn Val Tyr 
155 

Asn Phe Lys He 
170 

Asp His Tyr Gin 
185 

Pro Asp Asn His 



Asn Glu Lys Arg 
220 

Gly lie Thr His 
235 



95 



Gly Asn Tyr Lys 
110 

Val Asn Arg He 
125 

He Leu Gly His 



He Met Ala Asp 
160 



Arg His Asn He 
17 5 

Gin Asn Thr Pro 
190 

Tyr Leu Ser Thr 
205 

Asp His Met Val 



Gly Met Asp Glu 
240 



<210> 93 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ser(TCC)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 93 

atg tec tec tec tec tec age aag ggc gag gaa ctg ttc act ggc gtg 48 

Met Ser Ser Ser Ser Ser Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 

15 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 96 
Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc acc 144 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 192 
Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 240 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 
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gac cat atg aag cag cat gac ttt ttc aag age gec atg ccc gag ggc 288 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 

8 5 * 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 336 
Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 " 105 110 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 384 
Thf Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 



gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 
Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 " 135 140 



432 



aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 480 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 

145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 528 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 

165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 576 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 

180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 624 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 

195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 672 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 

210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 720 

Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 

225 230 235 240 

ctg tac aag tga 732 
Leu Tyr Lys 



<210> 94 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 94 

Met Ser Ser Ser Ser Ser Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
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65 70 

Asp His Met Lys Gin His 
85 

Tyr Val Gin Glu Arg Thr 
100 

Thr Arg Ala Glu Val Lys 
115 

Glu Leu Lys Gly lie Asp 
130 

Lys Leu Glu Tyr Asn Tyr 
145 150 

Lys Gin Lys Asn Gly lie 
165 

Glu Asp Gly Ser Val Gin 
180 

lie Gly Asp Gly Pro Val 
195 

Gin Ser Ala Leu Ser Lys 
210 

Leu Leu Glu Phe Val Thr 
225 230 

Leu Tyr Lys 



<210> 95 

<211> 732 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Ser (TCG) 5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 95 

atg teg teg teg teg teg age aag ggc gag gaa ctg ttc act ggc gtg 48 

Met Ser Ser Ser Ser Ser Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 

1 5 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 96 
Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gee aca tac gga aag etc acc 14 4 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 4 0 4 5 



75 

Asp Phe Phe Lys Ser 
90 

lie Phe Phe Lys Asp 
105 

Phe Glu Gly Asp Thr 
120 

Phe Lys Glu Asp Gly 
135 

Asn Ser His Asn Val 
155 

Lys Val Asn Phe Lys 
170 

Leu Ala Asp His Tyr 
185 

Leu Leu Pro Asp Asn 
200 

Asp Pro Asn Glu Lys 
215 

Ala Ala Gly He Thr 
235 



80 

Ala Met Pro Glu Gly 
95 



Asp Gly Asn Tyr Lys 
110 

Leu Val Asn Arg He 
125 

Asn He Leu Gly His 
140 

Tyr He Met Ala Asp 
1 60 



He Arg His Asn He 
175 

Gin Gin Asn Thr Pro 
190 

His Tyr Leu Ser Thr 
205 

Arg Asp His Met Val 
220 

His Gly Met Asp Glu 
240 



ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 192 
Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 
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ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 240 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gec atg ccc gag ggc 288 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 336 
Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 384 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 432 
Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 480 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 



aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 
Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 " 17 0 " 17 5 



528 



gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 57 6 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 624 
lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 672 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 720 
Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 732 
Leu Tyr Lys 



<210> 96 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 96 

Met Ser Ser Ser Ser Ser Ser Lys 
1 5 

Val Pro lie Leu Val Glu Leu Asp 
20 

Ser Val Ser Gly Glu Gly Glu Gly 
35 40 



Gly Glu Glu Leu Phe Thr Gly Val 

10 15 

Gly Asp Val Asn Gly His Lys Phe 
25 30 

Asp Ala Thr Tyr Gly Lys Leu Thr 
4 5 



Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 



WO 00/42215 
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50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 " 105 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 ' * 135 ~ 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 



<210> 97 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ser(TCT)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 97 

atg tct tct tct tct tct age aag ggc gag gaa ctg ttc act ggc gtg 48 

Met Ser Ser Ser Ser Ser Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1 5 .10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 96 
Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc ace 144 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 4 0 4 5 



ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 



192 



WO 00/42215 
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Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 

50 55 60 

ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 

65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gec atg ccc gag ggc 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 

85 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 

100 105 110 



gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 
Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 ' * 135 140 



240 



288 



336 



acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 384 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 



432 



aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 480 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 

145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 528 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 

165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 576 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 

180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 624 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 

195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 672 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 

210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 720 

Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 

225 230 235 240 

ctg tac aag tga 732 
Leu Tyr Lys 



<210> 98 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 98 

Met Ser Ser Ser Ser Ser Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 



WO 00/42215 



- ci - 



PCT/AU00/00008 



35 40 45 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 " 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 ~ 190 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 



<210> 99 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Thr (ACA) 5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 99 

atg aca aca aca aca aca age aag ggc gag gaa ctg ttc act ggc gtg 48 

Met Thr Thr Thr Thr Thr Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 

15 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 96 
Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gec aca tac gga aag etc acc 144 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
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35 40 45 

ctg aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 192 
Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 " 55 60 

ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 240 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 288 
Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

tat gtg cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 336 
Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 384 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 432 
Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 

aag ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 480 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 528 
Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 175 

gag gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 576 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 624 
lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 672 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 ~ 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 720 
Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 732 
Leu Tyr Lys 



<210> 100 
<211> 243 
<212> PRT 

<213> Artificial Sequence 
<400> 100 

Met Thr Thr Thr Thr Thr Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 



Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
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20 25 30 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 80 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 95 

Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 110 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 
115 120 125 

Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 
130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 160 

Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 170 175 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

Leu Tyr Lys 

<210> 101 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Thr(ACC)5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 

<400> 101 

atg acc acc acc acc acc age aag ggc gag gaa ctg ttc act ggc gtg 48 

Met Thr Thr Thr Thr Thr Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
15 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 96 
Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 
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a a ss ss as a 2" ss E - - a a; a as a 

S K S 52 c?1 S £ SB 3! ^ S S3 S 8 S S 

50 55 

a ?3 ss s ss s= - w » s 1 s ~ s ~ i 

65 70 

K S S3 S 1 S K S2 S i K K 52 S B 85 
- 0 B S S £ S i !S K BS W S§ S » 

y 100 1Ub 

s a ss a 53 a a a 25 a a a a ss a s 

115 120 

... „„ aaa aa t qqa aac att etc ggc cac 
ga g ctg aag ggc att gac ttt aag gag gat ^ ^ ^ ^ &y ^ 
Glu Leu Lys Gly He Asp rne l4Q 
130 1JD 

a a 2- a a a a a £ ?3 - - - - i 
a a a a a a a a a a a a a a a a 
a a a a a a a a a a a a a a a a 
a a, a a a a a a a a a a a a a a 
a a a a a a a a a a a a a a a a 

210 215 

a a a a a a a a a a a a a a a a 



225 

ctg tac aag tga 
Leu Tyr Lys 



144 



192 



240 



288 



336 



38 4 



432 



480 



528 



57 6 



624 



672 



720 



732 



<210> 102 
<211> 243 
<212> PRT 

<213> Artificial Sequence 



Me? 0 ?hr°Thr Thr Thr Thr Ser Lys Gly Glu Glu Leu Phe Thr Gly V.1 
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10 15 



, t Wa i n» Teu Asp Gly Asp Val Asn Gly His Lys Phe 
Val Pro He Leu Val Glu Leu asp uiy h 3Q 

20 2b 
Ser val S.r Gly Glu Gly Glu Gly »p Ala Thr Tyr Gly Lys Leu Thr 

lau Lys L lie Cys Thr Thr Gly Lys Leu Pro val Pro Trp Pro Thr 

50 55 
L eu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 

Z His Het Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
8 5 

Tyr val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys 

100 10b 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Ar g He 



85 

A 

100 

Val Lys Phe 
115 120 
,u Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly Hi, 



Glu Leu Lys Gly ue asp "« — — - 

130 1 - 3b 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 

Zl Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 

180 iy:> 
Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 

195 200 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 

210 215 
Le u Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 
Leu Tyr Lys 



<210> 103 
<2H> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 



<"£ Description of Artificial Sequence: Thr (ACG) 5GFP 



<220> 

<221> CDS 

<222> (1) . • (732) 



Ss a a s s s » s as k s k s sg sa 



48 



1 5 
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att etc qtq gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
fal Pro He leu Va! Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 



a ttt 96 



20 



f-r-i- ate aac aaa gag ggt gaa ggt gat gec aca tac gga aag etc acc 
Ser Val ler Gly SS Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 

4 0 4 ^ 



35 



4-+-^ a tr fn C acc act qqa aag etc cct gtg cca tgg cca aca 
III Lys Phe fie C?s Thr Thr gy Lys Leu Pro Val Pro Trp Pro Thr 



50 



ofrr- act- acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 
Leu Val Thr Thr Ml Ser Tyr Sy.Sa? Girl Cys Phe Ser Arg Tyr Pro 

65 70 75 

aac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 
!sp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 

i-at ata caq gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 
Tyr Val Gin GlS Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 

acc cqc qct gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 
Thr Arg III Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 
Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 

135 140 



130 



aaa eta qaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 
Lys Leu Tlu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
y 15Q 155 idu 



145 



195 



cag tct gec ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 

215 220 



210 



r+a eta aaa ttt qtq acc get get ggg ate aca cat ggc atg gac gag 
Leu Leu 111 Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 



225 

ctg tac aag tga 
Leu Tyr Lys 



<210> 104 
<211> 243 
<212> PRT 

<213> Artificial Seguence 



230 235 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 
Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 1 

gag gat gga tee gtg cag ctg gee gac eat tat caa cag aac act cca 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 19° 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tee acc 624 
fie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 



672 



720 



732 
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S:S°£hi°lhr Thr Thr Thr Ser Lys Gly Glu Gl« Leu Phe Thr Gly V.l 
1 5 10 

Val Pro lie Leu Val Glu Leu Asp Gly Asp V.l Asn Gly His Lys Phe 

20 25 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 



35 



40 



Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 



Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 

65 70 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 yu 

Tyr val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 

100 10b 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 

115 120 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His 

130 135 I 40 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 



lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 

195 200 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 

210 215 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 



230 235 



Leu Tyr Lys 



<210> 105 
<211> "732 
<212> DNA 

<213> Artificial Seguence 

^223> Description of Artificial Sequence: Thr (ACT ) 5GFP 

<220> 

<221> CDS 

<222> (1) • - (732) 

ftg^c^act act act act age aag ggc gag gaa ctg ttc act ggc gtg 48 



PCT/AUOO/00008 

WO 00/42215 _ C viii - 



Met Thr Thr Thr Thr Thr Ser Lys CI, 81» 01. Leu Phe Thr «, V.l 

« s s s a e s e s§ s a - ® 1 - - 

20 

ss & ss ifv ss ss s: S5 sx s ss s a; s - - 
a i" s s s: ss £ » » £ s a s s s s 

50 55 

a s £ s sk s E ffi a a S| ss s k - i 

65 70 

s s a » sa ss k ss k jg s s s e sc 
s ?s s a s s ss ss i s: e s « s s a 

J 100 iu::> 

S2 5 Ej StS !S IS SK K © C S 3 g - 

SS S S SB S5 S i K S S SB K S £ S5 S 

130 135 

tj| ss s: s - i - ~ - £ g £ JS m« SS K 
S s s £■ § s - «s SS £ "! S K s i s 

B C K S S S S K i SS £ S S S 3£ SS 

180 185 

;S l?y Bj 115 E S2 £ £ ?" K £ SS fyj 2S SS S 
S 2 S S SS S HI S iS K S! i C = s ? " 

s a s £ a s e c ss s s s ss s s sa 

o o n zoo 



225 230 

ctg tac aag tga 
Leu Tyr Lys 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



732 



<210> 106 



WO 00/42215 
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<211> 243 
<212> PRT 

<213> Artificial Sequence 



^Thrlhr Thr Thr Thr Ser Lys Gly Glu Giu Leu *~ ^15 ^ 

va! Pro lie Leu Val Glu Leu Asp Gly Asp VI Asn 61, Hi, Lys Ph. 

20 zr> 
S.r val Ser Gly Glu Ql, Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 

L eu Lys Pne ... =.s Thr Th. Gly Lys Leu Pro v.l Pro Trp Pro Thr 

50 bb 
L «u Val Thr Thr Ph. S.r Tyr Gly Va! Gin Gys Phe S.r Ar, Tyr Pro 

«p His ».t Lys Gin Hi. Asp Ph. Ph. Lys S.r Ala M.t Pro Glu Gly 

Tyt val Gin Glu Arg Thr II. Ph. Phe Lys Asp Asp Gly Asn Tyr hys 
y ioo 105 

Thr Arg Ala Glu val Lys Ph. Glu Gly Asp Thr Lou V.l Asn Ar, He 
115 12U 

- Q rlv Ile Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
Glu Leu Lys Gly He Asp rnc y 14Q 

130 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 

Ly s Gin Lys AS„ Gly H« Lys VaL Asn Ph. Lys II. Ar g His Asn lie 

Glu Asp Gly S.r val Gin L.u Ala Asp His Tyr Gin Gin Asn Thr Pro 

IX. Gly Asp Ty Pro val «u L.u Pro Asp Asn His Tyr Leu S.r Thr 

195 20U 
«1» S.r Ala h.u S.r I.ys Asp Pro Asn Glu Lys Arg A,P His «« v.l 

210 21b 
L .„ Leu Glu Phe Val Thr Ala Ala Gly II. Thr Hia Gly «.t Aap Glu 



225 
Leu Tyr Lys 



230 



<210> 107 
<211> "732 
<212> DNA 

<213> Artificial Sequence 



<220> . f . n nf Artificial Sequence: Trp (TGG ) 5GFP 

<223> Description of Artiricidi oc M 



<220> 

<221> CDS 

<222> (1) • - (732) 



WO 00/42215 
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aJq 0 tqg°tgg tgg tgg tgg age aag ggc gag gaa ctg ttc act ggc gtg 48 
Me? Trp Trl Trp Tr| Trp Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1*5 10 L:> 



qtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct ate aqc gga gag ggt gaa ggt gat gee aca tac gga aag etc ace 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

eta aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 
Leu Lys pL He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

eta qtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 7 0 75 80 

aac cat atg aag cag cat gac ttt ttc aag age gec atg ccc gag ggc 
Asp nil Mel Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 



85 



tat qtq cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 
Tvr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

acc cgc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 
Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 140 

aaa ctq qaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 
III III Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp 
145 150 155 ibu 

aaa caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 
Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys He Arg His Asn He 
165 170 

aaa aat aaa tec qtq cag ctg gec -gac cat tat caa cag aac act cca 
Glu ?sp GlJ Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 
He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gec ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
2io 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 
Leu Tyr Lys 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



732 



WO 00/42215 



- cxi - 



PCT/AUOO/00008 



<210> 108 
<211> 243 
<212> PRT 

<213> Artificial Sequence 

Me?^rp°?rp Trp Trp Trp Ser Lys Gly Glu Glu Leu Phe Thr Gly V.l 
Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 



20 



25 



Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 5 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 

65 70 75 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 



85 



Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 

y 100 105 11 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 

130 135 140 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 



180 



He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 

Leu Tyr Lys 



<210> 109 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<2 20> 

<223> Description of Artificial Sequence: Tyr (TAT) 5GFP 



WO 00/42215 



- cxii - 



PCT/AU00/00008 



<220> 

<221> CDS 

<222> (1) • • ("732) 



S~ £ - BI IB S « ffi B K S BI S 1 « 
S» IB JU BI S2 S: BI IB IB IB 82 BI 1?? SB BB IB 

20 25 

BI BI BI IB SB SB IB SS| IB K S £ IB - - - 
Su 5: S2 IB IB BI £ I?y BI BI IB ?2 IB IB IB B* 

50 55 

ffl BI S BI IB Br BI SB v2 sin IB IB IB IB IB IB 

"7 0 

S IB B? BI S IB IB IB IB BI BUB B? IB IB IB 

8 5 

IB BI IB SB BB BB Bl IB IB I" IB Bp IB BI IB BB 

100 105 

SS BB BI SB BI B! IB SB IB IB BB IB B? BB !S BI 

115 120 

=100 naa aat qqa aac att etc ggc cac 

1!? IB BB IB Si Bp IB BI SB L P SB «. .1. — 

130 135 

aa g ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gec gac 

Lvs Leu Glu Tyr Asn Tyr Asn Ser His flsn vajL yx 

145 150 155 

IB B! BI IB BI BI BB BI IB BI 52 S BB IB BI 

165 170 

SB up IB, IB B? IB IB BI IB IB IB IB IB IB BI IB 

180 18b 

SI IB BI IB BI BI IB IB IB IB BI BI IB IB IB BI 

195 200 

SS IB IB IB IB BI IB IB BI IB BI IB IB SB B? BI 

215 " u 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



57 6 



624 



672 



210 

m*ata acc act get ggg ate aca cat ggc atg gac gag 

III ill HI Phe fal Tnr La La IVy He Thr His Gly Met Asp Glu 
225 230 235 



720 



WO 00/42215 



- cxiii - 



PCT/AUOO/00008 



732 



ctg tac aag tga 
Leu Tyr Lys 



<210> HO 
<211> 243 
<212> PRT 

<213> Artificial Sequence 



t$%?iyr Tyr Tyr Tyr S.r Lys Gly Glu «~ Val 

V,! Pro lie Leu V.I CI. Leu Asp Gly »P V.I «» Gly His Lys Phe 

Ser V.l S.r Gly Glu Gly Glu Gly ASP Al. Thr Tyr Gly Lys Leu Thr 

L eu Lys Phi He Gys T h r Thr Gly Lys Leu Pro V.I Pro Trp Pro Thr 

50 " 
Leu val Thr Thr Ph. Ser Tyr Gly V.1 Gin Cys Phe Ser Arc, Tyr Pro 

Z His «er Lys Gin His A,p Ph. Ph. Lys Ser AX. «« Pro Glu GXy 

Tyr v.l Gin Glu Arg Thr XI. Ph. Phe Lys Asp Asp Gly £. Tyr Lys 

100 lu;> 
Thr Ar, AX, Glu v.l Lys Phe Glu Gly Asp Thr L.u v.l Asn Ar g IX. 

Glu Leu Lys GXy IX. Asp Ph. Lys GXu Asp GXy Asn IX. L.u Gly His 

130 135 
L ys Leu GXu Tyr Asn Tyr Asn Ser His Asn v.l Tyr He Met Al. Asp 

r. Gin Lys Asn Gly lie Lys V.l Asn Ph. Lys II. Ar, His As ? n lie 

CLu asp Gly Ser v.l Gin Leu Al. Asp His Tyr Gin Gin Asn Thr Pro 

180 iyb 
U. Gly AS 9 P Gly Pro V.l Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 

om Ser «. Leu Ser Lys Asp Pro Asn Glu Lys Aro Asp His Mer v.l 
210 

L eu Leu Glu Phe Val Thr Ma Ma Gly He Thr His Gly Met Asp Glu 



225 230 
Leu Tyr Lys 



<210> HI 
<211> 732 
<212> DNA 
<213> Artificial 



Sequence 



WO 00/42215 



- cxiv - 



PCT/AU00/00008 



< 2 2 0> 

<223> Description of Artificial Sequence: Tyr (TAC) 5GFP 

<220> 

<221> CDS 

<222> (1).. - (732) 

a^tac'tac tac tac tac age aag ggc gag gaa ctg ttc act ggc gtg 48 
Met Tyr Tyr Tyr Tyr Tyr Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1 5 1° " 

ate cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 

25 JU 



20 



tct gtc age gga gag ggt gaa ggt gat gee aca tac gga aag etc acc 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 

4 0 4b 



35 



rta aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 
Leu lys Phe ill Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 

55 60 



50 



ctq Qtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
70 "75 BU 



65 



gac cat atg aag cag cat gac ttt ttc aag age gec atg ccc gag ggc 
Lp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 



85 



tat at a caq gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 
Tyr Val Gil Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 

105 11U 



100 



acc cqc act gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 
Thr A?g Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 

120 125 



115 



aaa eta aaa ggc att gac ttt aag gag gat gga aac att etc ggc cac 
III Leu Lys Gly He Lp Phe Lys Glu Asp Gly Asn He Leu Gly His 



130 



135 140 



aaa ctq gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 
Lys Leu G?u Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 



145 



150 155 



aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn lie 
y 165 170 I 75 

aaa qat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 

IS Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 



180 



ate qqc qac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 
He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
!95 200 205 

cag tct gec ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



67 2 



PCT/AU00/00008 

WO 00/42215 _ cxv _ 



^ n rta oao ttt qtg acc get get ggg ate aca cat ggc atg gac gag 
Leu Leu SIS Phe Vl Thr Si. Ala Gly He Thr His Gly Met Asp Glu 



720 

His Gly Met Asp Glu 

225 230 235 

732 

ctg tac aag tga 
Leu Tyr Lys 



<210> 112 
<211> 243 
<212> PRT 

<213> Artificial Sequence 

Sl?°iyi"yr Tyr Tyr Tyr Ser Lys Gly Glu Glu Leu Phe Thr Gly VI 
1 5 10 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 

35 40 45 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 

65 70 75 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 " 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 I 20 125 

Glu Leu Lys Gly. He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 135 I 40 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 15 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 1 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 20 => 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 

Leu Tyr Lys 



<210> 113 
<211> 732 



WO 00/42215 



- cxvi - 



PCT/AUOO/00008 



<212> DNA 

<213> Artificial Sequence 

tllll Description of Artificial Sequence: Val(GTA)5GFP 

<220> 

<221> CDS 

<222> (1) . • (732) 

<400> 113 . ^ „ a o o^a eta ttc act ggc gtg 48 



KS SS S3 « SS S 85 K 1" S - - 1 « 

ss ss ss ss s? ss 2! ss i?| ss si ss s? ss ss ss 

20 25 

ss ss s a; ss a; a: sp ss s ss ss ss ss - ss 
a a ss ss ss ss ss ss a ss ss si a s P ss ss 

50 55 

a ss ss ss a ss ss is si ss ss ss ss ss ss s 240 
sp ss a 5: ss ss tap ss ss ss ss in ss sss ss ss 
- si ss ss s ss is ss ss ss is ss ss ss ss si 

100 105 

£ S, SS SS SS SS SS SS SIS SS S SS SI SS SS SS 



115 



ss ss ss ss sss ss ss ss ss ss ss ss SS SS SIS sis 

130 135 

ss ss sss ss ss ss ss SSS SSS SS SI ss ss 52 ss ss 

il* 150 I 55 



96 



144 



192 



288 



336 



384 



432 



480 



528 



aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 
Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn 
165 ± / u 

gag gat gga tec gtg cag ctg gec gac cat tat caa cag aac act cca 57 6 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Tnr Pro 

ate qqc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 624 
lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 

195 200 
cag tct gec ctg tct aaa gat ece aac gaa aag aga gac cac atg gtc 672 



PCT/AU00/00008 

WO 00/42215 _ cxv ii - 



G1 „ Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys «9 «P -1* ™ 

s s ss ss a s fi e ss s; s ss e k ss as 720 



225 230 

ctg tac aag tga 
Leu Tyr Lys 



<210> 114 
<211> 243 
<212> PRT 

<213> Artificial Sequence 



732 



^"val'val val val val Ser Lys Gly Glu Glu Leu Ph. Thr Gly Val 

V.l Pro lie Leu val Glu Leu Asp Gly Asp val Asn Gly His Lys Ph. 

20 2b 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 

L eu Lys Phe He Cys Thr Thr Gly Lys Leu Pro v.l Pro Trp Pro Thr 

L eu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Are Tyr Pro 
65 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
8 5 

Tyr val Gin Glu Ar g Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 

100 105 
Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 

115 12U 
Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 

130 135 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 

lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 

195 200 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 

210 215 

L eu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
9-»n 



225 230 
Leu Tyr Lys 



WO 00/42215 



- cxviii - 



PCT/AUOO/00008 



<210> 115 
<211> "732 
<212> DNA 

<213> Artificial Sequence 



J«£ Description of Artificial Sequence: Val (GTC) 5GFP 



<220> 

<221> CDS 

<222> (1) - - (732) 



<400> 115 — „_ ttc act ggc gtg 

12 SI VS VS SS VS IS SI SS SS SK SI «» % ^ 
SS Si Si ?a? 1" SS SS SS & & - S5 SZ SI Si 

20 2b 

S !S SK SS SK SS IS IS SU K Si Si IS SS Si Si 

za Si s! is Si si si ss si ss si si si ss sr. s? 

50 55 

tn rs ss KI Si Si Si is si SK Si Si Si is Si Si 

65 70 

K IS K SI SfS Si IS Si Si SI Si Si S S5 SS SS 

Si si is ss s si si si si si si ss ss si Si si 

100 105 

SI IS Hi is SI si SI IS ss IS Si SI SI Si is is 

115 120 

III si si ss is IS si si ss IS ss is Si Si ss is 

130 135 

!g 3 SS Si IS Si IS Si K SI SI Si S S3 K g 

•ii SK si is S?i SI SI SI Si si si si is is is rs 

165 

ss is ss si sa sk si is is is Si ss ss is si si 

180 iab 

si SS is ss Si SI Si Si IS IS Si SS Si SI Si Si 



96 



WO 00/42215 



- cxix - 



PCT/AU00/00008 



s ss s a s s g ss ss a - i - = s 55 
s s s s a s s ss « - 1 - 85 s s S 120 

225 230 732 



ctg tac aag tga 
Leu Tyr Lys 



<210> 116 
<211> 243 
<212> PRT 

<213> Artificial Sequence 



Met^Val^Val Vai Val V,! S.r Lys GXy Glu Glu I- ^15 
Val Pro U. Leu Val .1. Leu «P «g — — «* "S ^ 

ser V,! ser Gly Glu Gly Glu Gly Asp 1. Thr Tyr Gly Lys Leu Thr. 

L .u Lys Te ,1. Gys Thr THr Gly Lys Le» Pro Val Pro Trp Pro Thr 

50 " 
L e„ val Thr Thr Phe Ser Tyr Gly Val Gln Gys Ph. Ser Ar g Tyr Pro 

Z His Mat Lys Gln His Asp Phe PH. Lys Sar Ala « Pro Glu Gly 

Tyr val Gin C1..W Thr He PH. PH. Lys ASP A,P Gly Asn Tyr Lys 
y 100 

Thr «, Ala Glu vai Lys PH. Glu Gly Asp Thr Leu val ASH «, XL. 
115 iZU 

asd Phe Lys Glu Asp Gly Asn He Leu Gly Hxs 
Glu Leu Lys Gly He Asp pne uy* «- 14Q 

130 1J:> 

Hys Leu Glu Tyr Asn Tyr Asn Ser His Asn val Tyr lie «et Ala »P 

nys Gin Lys Asn Gly He Lys val Asn ft. Lys XI. Are His Asn II. 

Glu MP Gly Ser Ti Gin Leu ,1a Asp His Tyr Gin Gin Asn Thr Pro 

n. Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 

Gln ser Ala Leu Ser Lys Asp Pro Asn Glu Lys «, Asp His MeS val 

210 " 3 
L .„ Leu Glu Phe val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 



225 230 



WO 00/42215 



- cxx - 



PCT/AU00/00008 



Leu Tyr Lys 



<210> 117 
<211> "732 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence: Val(GTG)5GFP 



<220> 

<221> CDS 

<222> (1) . . (732) 



•is? s? sa si si sn ^ as ss ss ssi ss ss ss si 

1 5 10 

?a ss si ss si i" 21 ss ss ss si ss s? ss si ss 



20 25 30 

£ SI IS 85 SS SS SS SS IS K SS SS 25 5? S? 

35 40 4D 

ctg'aaa ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 
Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 



21 SS £ SS SS SS 5S 15 21 52 S5 SS SS 15 SS SS 



65 



IS S'S S? 51 SS SS IS SS SS 51 SS IS £? SS IS S5 



85 ' . 9° 



SS SI S2 IS 15 SS SS SS SS 51 IS IS S? SS. SS 51 



100 



SS 15 12 SS SS 51 SSI SS 15 IS SS 21 5? SS SS 51 



IS 21 51 15 Si IS SS 51 SS IS 15 Sj Si 21 SS SS 
51 21 SS SS SI Sr, SI SS SS Si SI SS SS S? IS IS 

150 155 



aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 
Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 170 

gag gat gga tec gtg cag ctg gec gac cat tat caa cag aac act cca 
111 Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 185 



96 



144 



192 



240 



288 



336 



ate 384 



432 



480 



528 



576 



WO 00/42215 



- cxxi - 



PCT/AUOO/00008 



*i-n nnr aac nqc cct gtg etc etc cca gac aac cat tac ctg tec ace 624 
Tie ITy Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 20o 

caq tct gec ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 672 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 720 
Leu Leu GlS Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 



ctg tac aag tga 
Leu Tyr Lys 



<210> 118 
<211> 243 
<212> PRT 

<213> Artificial Sequence 



732 



Me^Val'val Val Val Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 
1 5 10 13 

Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 



20 



Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 

Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 60 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 90 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
- - 105 11U 



100 



Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 

130 i35 14 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 1 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 110 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 

185 x y u 



180 



He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 

!95 200 205 

Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val. 

210 215 220 



WO 00/42215 



- cxxii - 



PCT/AUOO/00008 



Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 

230 z:> ° 



225 

Leu Tyr Lys 



<210> 119 
<211> "?32 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence: Val (GTT) 5GFP 

<220> 

<221> CDS 

<222> (1) . - ("732) 

<400> 119 aa ttc act qqc qtq 48 



qss ?s e e & bi sb sh sb k s £ sb a 

?B S BS SS sr. 2S Sp SB SS S3 BS 2?y SS BS BS 

20 25 

S2 BS BS SB SB SB SB © K BS S BS IB SS BS - 

35 40 *° 

ss bs ss bs bs ss sb bs bs « sa is sj - ss 

50 55 
ctg gtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 
Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 75 

it; s: SB BS ss bs is BS BS SI SS BS s? SS IS SB 

8 5 

BS 55! SS S!S K SS BS SS SS BS SS IS SB IS Br BS 

100 105 

SS SS S| SB SS BS BS SB S?5 SB SB SS 32 X 5 S IS 
IB SS BS SB ?B IB BS Bl SB Bp IB BS BS BS SB SS 

130 135 i4U 

BI SS SB BS BS B' BS BS SS BS Bl BS SS SB IB BJ 

145 150 

Bl SB B! BS SB SS B! B! BS BS Bl BS SS SB BS SS 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



nnlA „ t e PCT/AUO0/OO0O8 
WO 00/42215 _ cxxiii _ 



trr ota caa ctq qcc gac cat tat caa cag aac act cca 
III Hi Ser ?2 G^n 2S Si. Asp His Tyr Gin Gin Asn Thr Pro 

180 185 

in m i !?v I" 53 E 2S s S*p - - Sj 2! SS ?s 
ss sn K £2 £ °" & SS KS is i £ SS S? SK 

210 215 220 

rr*r, ttt dtfl acc act get ggg ate aca cat ggc atg gac gag 
III 2S SS Phe Va? ?hr S5 La S? He Thr His Gly Met Asp Glu 
225 230 235 

ctg tac aag tga 
Leu Tyr Lys 

<210> 120 
<211> 243 
<212> PRT 

<213> Artificial Sequence 

M^°?af Sal Val Val Val Ser Lys Gly Glu Glu Leu Phe Thr Gly VI 

1 ^ 

Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 

Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 

35 40 
Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 70 ^ 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 
85 .90 ' J 

Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 
115 120 125 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 

130 135 
Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He 
165 l" 70 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 



576 



624 



672 



720 



732 



180 185 

195 



lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 

200 205 



WO 00/42215 



- cxxiv - 



PCT/AUOO/00008 



t oil q« Lvs Asp Pro Asn Glu Lys Arg Asp His Met Val 
Gin Ser Ala Leu Ser Lys asp r 22Q 

210 2 " 
L «„ Leu Glu Phe ».l Thr Al, Ala Gly He TAr His Gly Met Asp Glu 
225 230 
Leu Tyr Lys 



<210> 121 
<211> "732 
<212> DNA 

<213> Artificial Sequence 



ZVZ ^intion of Artificial Sequence: Stop (TAA) 5GFP 



<223> Description 

<220> 

<221> CDS 

<222> (1) • . (732) 



<400> 121 oarT aaa ct q ttc act ggc gtg 48 

at, taa taa taa taa taa age aag ggc gag gaa ctg ^ ^ 

Met 10 lb 

«s s ss i «a k a b is aS « ss a; s= - a 

E SS ffi gS g!u 8S !" If, Up E S SS SS S? - - 

a s s s s s s « » - s ~ «S « s s s 

50 55 

a a a a a ss s s «a s a s: ss s ss s 

65 70 

K S S S S S S S - i S K « SS B 15 

- si a is s s - s i a a aS js i £ 5! 

100 1u:d 

a a is a a a - a is a ss a a =s a s 
a a a ss ss s s is k a ss i ss a ss a 

130 135 

^ tec cac aat gtg tac ate atg gec gac 

aag ctg gaa tac aac tat aac tec cac g y Met Ala Asp 

Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn vai y 16Q 



96 



144 



192 



240 



288 



336 



384 



432 



480 



145 150 



WO 00/42215 



- cxxv - 
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528 



aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 
Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie 
165 17 0 17 5 

gag gat gga tec gtg cag ctg gec gac cat tat caa cag aac act cca 57 6 
Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 

IOC 1 QO 



180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec ace 
He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 
195 200 205 

cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg ace get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

ctg tac aag tga 
Leu Tyr Lys 



624 



672 



720 



732 



<210> 122 
<211> 732 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Stop (TAG) 5GFP 

<220> 

<221> CDS 

<222> (1) . . (732) 



<400> 122 

atg tag tag tag tag tag age aag ggc gag gaa ctg ttc act ggc gtg 
Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 

1 5 10 15 

gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gee aca tac gga aag etc ace 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 



ctg gtc act ace ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 

Leu Val Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 

65 70 75 80 

gac cat atg aag cag cat gac ttt ttc aag age gee atg ccc gag ggc 

Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 

85 90 95 



48 



96 



144 



ctg aaa ttc ate tgc ace act gga aag etc cct gtg cca tgg cca aca 192 
Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 "* 55 60 



240 



288 



tat gtg cag gag aga ace ate ttt ttc aaa gat gac ggg aac tac aag 
Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 



336 



WO 00/42215 
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100 



105 HO 



acc cqc get gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 

Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie 

115 120 125 

gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 

Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 



130 



135 140 



aaq ctg gaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 
Lv s Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
145 150 155 160 



aac att 528 
e 



aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac at 

Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn II 

165 1*70 175 

qaq gat gga tec gtg cag ctg gee gac cat tat caa cag aac act cca 

Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 



180 185 190 

ate ggc gac ggc cct gtg etc etc cca gac aac cat tac ctg tec acc 

He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 

195 200 205 



384 



432 



480 



576 



624 



672 



cag tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 
210 215 220 

ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 720 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 
225 230 235 240 

7 32 

ctg tac aag tga 
Leu Tyr Lys 

<210> 123 

<211> 732 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Stop (TGA) 5GFP 

<220> 

<221> CDS 

<222> (1) . . ("732) 



<400> 123 ft 

atq tga tga tga tga tga age aag ggc gag gaa ctg ttc act ggc gtg 40 

Me t Ser Lys Gly Glu Glu Leu Phe Thr Gly Val 

15 10 15 



gtc cca att etc gtg gaa ctg gat ggc gat gtg aat ggg cac aaa ttt 
Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe 
20 25 30 

tct gtc age gga gag ggt gaa ggt gat gee aca tac gga aag etc acc 
Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 45 



96 



144 



WO 00/42215 
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ctg aaa 
Leu Lys 
50 



ttc ate tgc acc act gga aag etc cct gtg cca tgg cca aca 

Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 

55 60 

ctq qtc act acc ttc tct tat ggc gtg cag tgc ttt tec aga tac cca 

Leu Sal Thr Thr Phe Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro 

65 ™ 75 

aac cat atg aag cag cat gac ttt ttc aag age gec atg ccc gag ggc 

Asp His Me? Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly 



tat qtq cag gag aga acc ate ttt ttc aaa gat gac ggg aac tac aag 
Tyr Val IS Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys 
100 105 HO 

acc cqc qct gaa gtc aag ttc gaa ggt gac acc ctg gtg aat aga ate 
Tnr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 



115 



120 125 



gag ctg aag ggc att gac ttt aag gag gat gga aac att etc ggc cac 
Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His 
130 I 35 140 

aaa eta aaa tac aac tat aac tec cac aat gtg tac ate atg gee gac 
"2 Leu Gil Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp 
14 5 150 155 

aag caa aag aat ggc ate aag gtc aac ttc aag ate aga cac aac att 
Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn lie 
165 170 1 D 

^ = rr o^t* noa tec ata caq ctg gee gac cat tat caa cag aac act cca 
III Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro 
180 I 85 190 

ate aac aac qgc cct gtg etc etc cca gac aac cat tac ctg tec acc 
He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr 

200 205 



195 



caa tct gee ctg tct aaa gat ccc aac gaa aag aga gac cac atg gtc 
Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val 

215 220 



210 



ctg ctg gag ttt gtg acc get get ggg ate aca cat ggc atg gac gag 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu 



225 2 30 

ctg tac aag tga 
Leu Tyr Lys 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



732 



<210> 124 
<211> 717 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP humani 
control 

<220> 

<221> CDS 

<222> (1) . • (717) 



WO 00/42215 



- cxxviii - 



PCT/AU00/00008 



aJg°agc 2 aag ggc gag gaa ctg ttc act ggc gtg gtc cca att etc gtg 48 
Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 

5 10 lb 



1 



qaa ctg gat ggc gat gtg aat ggg cac aaa ttt tct gtc age gga gag 
GlS Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 



20 25 



ggt gaa ggt gat gec aca tac gga aag etc ace ctg aaa ttc ate tgc 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 



35 



40 45 



ace act gga aag- etc cct gtg cca tgg cca aca ctg gtc act acc ttc 
Thr Thr Gly LyI Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 

55 60 



50 



tct tat ggc gtg cag tgc ttt tec aga tac cca gac cat atg aag cag 
Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 " 70 75 80 

cat gac ttt ttc aag age gee atg ccc gag ggc tat gtg cag gag aga 
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

acc ate ttt ttc aaa gat gac ggg aac tac aag acc cgc get gaa gtc 
Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 HO 

aag ttc gaa ggt gac acc ctg gtg aat aga ate gag ctg aag ggc att 
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 125 

qac ttt aag gag gat gga aac att etc ggc cac aag ctg gaa tac aac 
Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

tat aac tec cac aat gtg tac ate atg gee gac aag caa aag aat ggc 
Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

ate aag gtc aac ttc aag ate aga cac aac att gag gat gga tec gtg 
He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 170 I 75 

caq ctg gee gac cat tat caa cag aac act cca ate ggc gac ggc cct 
Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 

atq etc etc cca gac aac cat tac ctg tec acc cag tct gee ctg tct 
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 

aaa gat ccc aac gaa aag aga gac cac atg gtc ctg ctg gag ttt gtg 
Lvs Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 

acc get get ggg ate aca cat ggc atg gac gag ctg tac aag tga 
Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
225 " 230 235 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



717 



<210> 125 



WO 00/42215 



- cxxix - 
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<211> 238 
<212> PRT 

<213> Artificial Sequence 

M^Ser^ys Gly Glu Glu Leu Phe Thr Gly V.1 V.1 Pro He Leu Val 

1 ^ 



Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 

20 25 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 

35 40 
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 

50 55 
Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 

65 70 75 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 

Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 

115 120 1 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 

130 135 
Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 

He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 I 70 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 I 85 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 

200 



195 



Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 

Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 126 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
< 2 2 0 > 

<223> Description of Artificial Sequence: Ala(GCA)5 
primer 

cggggtacca tggcagcagc agcagcaacjc aagggcgagg aactgttcac tggc 54 



WO 00/42215 



- cxxx - 
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<210> 127 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 



<220> ari-ifirial Sequence: Ala(GCC)5 

<223> Description of Artificial ^M^ c 

primer 

^gggtacca tggccgccgc cgccgccaac aagggcgagg aactgttcac tggc 



<210> 128 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 



Description of Artificial Sequence: Ala(GCG)5 
primer 

cggggtacca tggcggcggc ggcggcga^c aagggcgagg aactgttcac tggc 



<210> 1293 
<211> 54 
<212> DNA 

<213> Artificial Sequence 

tllll Description of Artificial Sequence: Ala (GCT) 5 
primer 

cggggtacca tggctgctgc tgctgctacc aagggcgagg aactgttcac tggc 

<210> 130 
<211> 54 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Arg (AGA) 5 
primer 

cggggtacca tgagaagaag aagaagaagc aagggcgagg aactgttcac tggc 54 

<210> 131 
<211> 54 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Arg ( AGG) 5 
primer 

<400> 131 aa rtattcac tqgc 54 

cggggtacca tgaggaggag gaggagga^c aagggcgagg aactgttcac gg 



WO 00/42215 



- cxxxi - 
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<210> 132 
<211> 54 
<212> DNA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence: Arg (CGA) 5 



r 



prime 

S^cca tgcgacgacg acgacgaasc aa^agg aactgttcac t gg c 

<210> 133 
<211> 54 
<212> DMA 

<213> Artificial Sequence 

<220> • 4--^ of Artificial Sequence: Arg(CGC)5 

<223> Description ot Artiricidi 

primer 

Cggggtacca tgcgccgccg ccgccgca^ aagggcgagg aactgttcac t gg c 

<210> 134 
<211> 54 
<212> DNA 

<213> Artificial Sequence 

^223^ Description of Artificial Sequence: Arg (CGG) 5 
primer 

cggggtacca tgcggcggcg gcggcgga^c aagggcgagg aactgttcac tggc 

<210> 135 

<211> 54 

<212> DNA 

<213> Artificial Sequence 

Description of Artificial Sequence: arg(CGT)5 
primer 

cggggtacca tgcgtcgtcg tcgtcgtasc aagggcgagg aactgttcac tggc 

<210> 136 
<211> 54 
<212> DNA 

<213> Artificial Sequence 



54 



54 



54 



54 



<2^> Description of Artificial Sequence: Asn(AAC)5 
primer 

cggggtacca tgaacaacaa caacaacaoc aagggcgagg aactgttcac tggc 



WO 00/42215 



- cxxxii - 
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<210> 137 
<211> 54 
<212> DNA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence: Asn (AAT ) 5 
primer 

<400> 137 

cggggtacca tgaataataa taataatagc aagggcgagg aactgttcac tggc 



54 



<210> 138 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Asp(GAC)5 
primer 

<400> 138 

cggggtacca tggacgacga cqacgacagc aagggcgagg aactgttcac tggc 

<210> 139 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Asp (GAT) 5 
primer 

<400> 139 

cggggtacca tggatgatga tgatgatagc aagggcgagg aactgttcac tggc 



<210> 140 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cys(TGC)5 
primer 

<400> 140 

cggggtacca tgtgctgctg ctqctgc aqc aagggcgagg aactgttcac tggc a* 

<210> 141 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cys(TGT)5 
primer 



<400> 141 



WO 00/42215 
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cggggtacca tgtgttgttg ttgttgtagc aagggcgagg aactgttcac tggc 



54 



<210> 142 
<211> 54 
<212> DNA. 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Gln(CAA)5 
primer 

cggggtacca tgcaacaaca acaacaaacrc aagggcgagg aactgttcac tggc 



<210> 143 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Gln(CAG)5 
primer 

cggggtacca tgcagcagca gcagcaga^c aagggcgagg aactgttcac tggc 54 



<210> 144 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Glu(GAA)5 
primer 

cggggtacca tggaagaaga agaagaaagc aagggcgagg aactgttcac tggc 54 

<210> 145 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Glu(GAG)5 
primer 

cggggtacca tggaggagga ggaggagao_c aagggcgagg aactgttcac tggc 54 

<210> 146 
<211> 54 
<212> DNA 

<213> Artificial Sequence 

<220> ^ n ( „ r7l ., 

<223> Description of Artificial Sequence: Gly (GGA) o 

primer 



WO 00/42215 



- cxxxiv - 
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cggggtacca tgggaggagg aggaggaa^c aagggcgagg aactgttcac tggc 54 

<210> 147 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Gly(GGC)5 
primer 

cggggtacca tgggcggcgg cggcggcacfc aagggcgagg aactgttcac tggc 54 



<210> 148 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Gly (GGG) 5 
primer 

<400> 148 ^ . 54 

cggggtacca tggggggggg ggggggga^c aagggcgagg aactgttcac tggc 



<210> 149 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Gly (GGT) 5 
primer 

<400> 149 ^ . . _ , rtnr c 4 

cggggtacca tgggtggtgg tggtggta^c aagggcgagg aactgttcac tggc 

<210> 150 
<211> 54 
<212> DNA 

<213> Artificial Sequence 

<220> . f „-. c 

<223> Description of Artificial Sequence: His(CAC)5 

primer 

cggggtacca tgcaccacca ccaccacagc aagggcgagg aactgttcac tggc 54 



<210> 151 
<211> 54 
<212> DNA 

<213> Artificial Sequence 

<220> . 

<223> Description of Artificial Sequence: His (CAT) 

primer 



WO 00/42215 



- cxxxv - 
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cggggtacca tgcatcatca tcatcata^c aagggcgagg aactgttcac tggc 54 

<210> 152 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: lie (ATA) 5 
primer 

cggggtacca tgataataat aataataagc aagggcgagg aactgttcac tggc 54 

<210> 153 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: lie (ATC) 5 
primer 

cggggtacca tgatcatcat catcatcagc aagggcgagg aactgttcac tggc 54 

<210> 154 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: lie (ATT) 5 
primer 

cggggtacca tgattattat tattattaqc aagggcgagg aactgttcac tggc 54 

<210> 155 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Leu(CTA)5 
primer 

<400> 155 t-A 
cggggtacca tgctactact actactaagc aagggcgagg aactgttcac tggc 

<210> 156 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Leu (CTC) 5 



WO 00/42215 
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primer 

cggggtacca tgctcctcct cctcctcagc aagggcgagg aactgttcac tggc 54 



<210> 157 
<211> 54 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Leu(CTG)5 
primer 

cggggtacca tgctgctgct gctgctgacc aagggcgagg aactgttcac tggc 54 

<210> 158 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<22 0 > 

<223> Description of Artificial Sequence: Leu(CTT)5 
primer 

cggggtacca tgcttcttct tcttcttagc aagggcgagg aactgttcac tggc 54 

<210> 159 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
< 2 2 0 > 

<223> Description of Artificial Sequence: Leu (TTA) 5 
primer 

cggggtacca tgttattatt attattaa^c aagggcgagg aactgttcac tggc 54 



<210> 160 
<211> 54 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Leu (TTG) 5 
primer 

cggggtacca tgttgttgtt gttgttgaac aagggcgagg aactgttcac tggc 54 

<210> 161 
<211> 54 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: Lys (AAA) 5 
primer 

cggggtacca tgaaaaaaaa aaaaaaaa^c aagggcgagg aactgttcac tggc 54 

<210> 162 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Lys (AAG) 5 
primer 

cggggtacca tgaagaagaa gaagaaga^c aagggcgagg aactgttcac tggc 54 



<210> 163 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Phe(CTT)5 
primer 

cggggtacca tgcttcttct tcttcttagc aagggcgagg aactgttcac tggc 54 



<210> 164 
<211> 54 
<212> DNA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence: Phe(TTC)5 
primer 

cggggtacca tgttcttctt cttcttcagc aagggcgagg aactgttcac tggc 



<210> 165 
<211> 54 
<212> DNA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence: Pro(CCC)5 
primer 

<400> 165 . 
cggggtacca tgcccccccc cccccccaqc aagggcgagg aactgttcac tggc 



<210> 166 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
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<22?> Description of Artificial Sequence: Pro(CCG)5 
primer 

cggggtacca tgccgccgcc gccgccgaac aagggcgagg aactgttcac tggc 

<210> 167 
<211> 54 
<212> DNA 

<213> Artificial Sequence 

<223^ Description of Artificial Sequence: Pro(CCT)5 
primer 

cggggtacca tgcctcctcc tcctcctagc aagggcgagg aactgttcac tggc 54 



<210> 168 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 



^223> Description of Artificial Sequence: Pro(CGA)5 
primer 

Cggggtacca tgcgacgacg acgacgaa^c aagggcgagg aactgttcac tggc • 54 



<210> 169 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence: Ser(AGC)5 
primer 

cggggtacca tgagcagcag cagcagca^c aagggcgagg aactgttcac tggc 54 



<210> 170 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence: Ser (AGT) 5 
primer 

cggggtacca tgagtagtag tagtagtagc aagggcgagg aactgttcac tggc 54 



<210> 171 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Ser(TCA)5 
primer 

<400> 171 

cggggtacca tgtcatcatc atcatca aqc aagggcgagg aactgttcac tggc 54 



<210> 1726 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ser(TCC)5 
primer 

<400> 172 

cggggtacca tgtcctcctc ctcctcc agc aagggcgagg aactgttcac tggc 54 



<210> 173 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ser(TCG)5 
primer 

<400> 173 

cggggtacca tgtcgtcgtc gtcgtcg agc aagggcgagg aactgttcac tggc 54 



<210> 174 
<211> 54 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Ser(TCT)5 
primer 

<400> 174 

cggggtacca tgtcttcttc ttcttct agc aagggcgagg aactgttcac tggc 54 



<210> 175 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:. Thr (ACA) 5 
primer 

<400> 175 

cggggtacca tgacaacaac aacaaca agc aagggcgagg aactgttcac tggc 54 



<210> 176 
<211> 54 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Thr(ACC)5 
primer 

<400> 176 

cggggtacca tgaccaccac caccacc agc aagggcgagg aactgttcac tggc 54 

<210> 177 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Thr (ACG) 5 
primer 

<400> 177 . 

cggggtacca tgacgacgac gacgacg aqc aagggcgagg aactgttcac tggc 54 

<210> 178 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Thr (ACT) 5 
primer 

<400> 178 

cggggtacca tgactactac tactact agc aagggcgagg aactgttcac tggc 54 

<210> 179 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Trp (TGG) 5 
primer 

<400> 179 

cggggtacca tgtggtggtg gtggtgg aqc aagggcgagg aactgttcac tggc 54 

<210> 180 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Tyr (TAT) 5 
primer 

<400> 180 

cggggtacca tgtattatta ttattat agc aagggcgagg aactgttcac tggc 54 



<210> 181 
<211> 54 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Val(GTA)5 
primer 

<400> 181 

cggggtacca tggtagtagt agtagta agc aagggcgagg aactgttcac tggc 54 



<210> 182 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Val(GTC)5 
primer 

<400> 182 

cggggtacca tggtcgtcgt cgtcgtc agc aagggcgagg aactgttcac tggc 54 



<210> 183 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Val(GTG)5 
primer. 

<400> 183 

cggggtacca tggtggtggt ggtggtgagc aagggcgagg aactgttcac tggc 54 



<210> 184 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Val(GTT)5 
primer 

<400> 184 

cggggtacca tggttgttgt tgttgtt agc aagggcgagg aactgttcac tggc 54 



<210> 185 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 3 f 
oligonucleotide common primer 



<400> 185 

ccggaattc t ca cttgtaca ggtggtccat gcc 
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